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Preface 


As a result of a ubiquitous society, displays for electronic appliances such 
as mobile computers and mobile phones demand high efficiency. High 
efficiency electronics were realized with the advent of nanoscale devices. 
Especially for nanoscale devices, the importance of the chemical-mechan- 
ical planarization (CMP) process emerged to achieve the integration 
and multilevel interconnections. The CMP process supports increasing 
capacity. 

CMP technology became absolutely necessary for production processes 
of the next generation of semiconductor. Pursuing global planarization, 
planarization using CMP could not be against the trend. Therefore, it is 
used for production of device makers, which is dielectric CMP, shallow 
trench isolation CMP, and metal CMP. Chapters 2 through 4, respectively, 
discuss these processes. The nanotopography of the surface of silicon 
wafers has recently become an important issue because it may seriously 
affect the post-CMP uniformity of thickness variation of dielectrics. For 
this reason, Chapter 5 explains the importance of nanotopography. 

However, CMP processing has faced a new aspect: design rules below 
50 nm. Because new structures and new materials are used for improving 
the performance of devices, the existing CMP slurry is facing its limits. 
Chapter 6 provides novel CMP slurry for application to memory devices 
beyond 50 nm technology. 

The authors are especially grateful to Allison Shatkin and Amy Blalock 
at CRC Press for their valuable guidance. 
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Overview of CMP Technology 


1.1 Motivation and Background 


Since Bardeen, Brattain, and Shockley of Bell Laboratories invented the 
transistor in the 1940s, semiconductor integrated circuit (IC) technology 
has been remarkably developed. The improvement of operation capacity 
and speed, thanks to the development of semiconductor technology, is 
playing a key role in the rapid progress of current scientific technologies. 
In the flow of rapid progress, it was required for semiconductors to possess 
super high speed, capacity, and performance, and as a result, integration 
of the transistor is increasing. Since the beginning of the 21st century, the 
design rule of semiconductors was set below 100 nm. Samsung Electronics 
developed 64 Gb NAND flash memory using 30 nm technology in 2007, 
Hynix developed 2G DRAM2 using 66 nm technology, and Intel devel- 
oped the Core™2 Quad Processor using 65 nm technology. Although 
these remarkable developments pursue a miniature through vertical high 
integrated circuit, it is also possible to obtain super high speed and capac- 
ity through a horizontal, high integrated circuit (multiple metal lines). 
In multilayer metal lines process technology, it is difficult to focus and 
impossible to form minute structures when the dielectric layer and metal 
line have rugged surfaces. Therefore, planarization processing was neces- 
sary to ensure lithographic depth of focus (DOF), which was considered 
to be the most important factor. A variety of planarization methods of 
high degree were indispensable in using a new material and transforma- 
tion from two-dimensional flat structures to three-dimensional multilayer 
structures into a wide and high integrated ultra large scale integrated 
(ULSI) circuit device with a diameter from 200 mm to 300 mm of silicon 
wafer. By using preexisting borophosphosilicate glass (BPSG) deposition 
and planarization methods such as reflow, spin on glass (SOG), and spin 
coating, it may soften rugged surfaces to some degree during dielectric 
layering (Table 1.1). However, problems such as aggravation of formation, 
position precision, rugged surface device according to multilayer metal 
line, and three-dimensional structures would occur (Figure 1.1). 
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TABLE 1.1 
Types and Features of Existing Planarization 
Types Examples of Method Features 
Etching Sputter, RIE, plasma etching Easy process, difficult to control 
etching 
Deposition Bias sputter, Bias ECR, plasma Damage concern, too much dust 


chemical vapor deposition 
(CVD), RF plasma CVD 


Reflow floating Reflow, SOG (spin on glass) Easy to utilize, discontinuity, 
instability, establishment of 
migration 

Selective Selective CVD, selective Possible to fill only the required 

growth epitaxial growth part (hole), low selective 


growth control, instability 


(a FIGURE 1.1 Formation of planarization of semicon- 
= p U r eee 
E EE Ez] ductor metal line. 
(b 
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(c 
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To solve this problem, Kaufmann, from IBM, developed in the 1980s 
chemical-mechanical planarization (CMP), a new global planarization 
technique that combines mechanical polishing with chemical polishing 
into a manufacturing process. This was the origin of CMP process in the 
modern sense (Figure 1.2). CMP chemically or mechanically polishes the 
semiconductor's surface for planarization. The chemical action affects 
productivity and polishing selectivity rate, whereas the mechanical action 
contributes to the smoothness of the surface. As in Figure 1.1c, the CMP 
process eliminated the rugged upper part of a surface in an orderly man- 
ner, regardless of the low area condition. CMP made global planarization 
possible without unevenness and solved the problem of existing tech- 
niques that caused deteriorated layers to form and lowered shape preci- 
sion. For this reason, CMP technology became absolutely necessary for 
the production process for the next generation of semiconductors and it 
is used for production of device makers. Internationally, many businesses 
and lab organizations are actively making progress with research into a 
new process technology. 

This CMP propagation and active research progress may well be the first 
departure of shallow trench isolation (STI) CMP. STI CMP was introduced 
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FIGURE 1.2 Global planarization by CMP. 


by replacing local oxidation of silicon (LOCOS) with STI by the splitting 
method for each transistor. For the process of complementary metal-oxide 
semiconductor (CMOS), each transistor used LOCOS because separation 
by electricity was needed to eliminate a short. Unfortunately, this LOCOS 
process caused a severe problem with device integration because its design 
rule entered under 0.5 um, making a sharp edge called bird's beak. To 
solve this problem, STI led to an increase in a very tiny active area and in 
device packing density. Despite the advantages of STI, it cannot be formed 
without the CMP process because to form STI, polishing must be stopped 
by eliminating gap filling oxide film at the Si,N, layer. This process is only 
possible through STI CMP technology. Currently, manufactured semicon- 
ductor devices on the basis of CMOS are produced applying STI CMP. 


1.2 The Key Factors of CMP Process 


The characteristics of CMP are material removal rate (MRR), thickness 
uniformity, and surface quality, and they are directly related to device 
characteristics and productivity. These characteristics are determined by 
each factor as per Figure 1.3. 


1.2.1 CMP Polishing Machines 


Whereas wafer polishing machines polish dozens of micrometers (um), 
CMP polishing machines polish 0.5 ~ 1.0 um of target film. Uniformity 
is extremely important after polishing. The features of CMP polishing 
machines include automation, high precision, reproducibility, and control 
of process parameter. Polishing machines are largely divided into rotary 
type, orbital type, linear type, and fixed table type according to the move- 
ment of the wafer carrier and table (Table 1.2). 

A schematic diagram of the Mirra polisher, currently used in a device 
maker, is shown in Figure 1.4. In the cassette load, the sensor perceives the 
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FIGURE 1.3 The elements (factors) of CMP process. 


TABLE 1.2 
Motion of Wafer Carrier and Table by CMP Tool Type 
Motion 

CMP Tool Type Wafer Carrier Table (Platen) 
Rotary Rotate Rotate 
Orbital Rotate Orbital path 
Linear Linear path Linear path 
Fixed table Orbital path Stationary 


wafer, and the FABS robot moves to the transfer station. CMP is transferred 
to the head clean load unload (HCLU) where CMP is actually formed after 
a long robot arm absorbs the wafer by using a void space. The FAB robot 
arm transfers the finished CMP processed wafer to the input station for 
cleaning in the HCLU. 


1.2.2 Slurry for CMP 


As mentioned earlier, the core of CMP technology is slurry for CMP based 
on nanotechnology, even though the importance of CMP technology was 
highlighted before the introduction of a nano process for semiconductor 
processing. Slurry is composed of water, polishing particle, alkali, inor- 
ganic salts, and organic compound, and specific slurry is manufactured 
through appropriate selection of the components. 

Some of the most important characteristics of slurry are equal disper- 
sion of polished particle caused scratch on the surface of wafers, mini- 
mizing of metal current, optional polishing characteristics, viscosity, and 
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FIGURE 1.4 Mirra schematic. 


storage safety that cause polysilicon abnormal resistance and metal line 
short. Commonly used solvent is highly pure, from which most of the 
impurities such as ion, small particles, microorganisms, and organic sub- 
stance are eliminated, with a 5 - 18 MOcm resistance rate. The appropriate 
slurry is applied according to objects to be polished. Table 1.3 summarizes 
the slurry required for various processes. 
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TABLE 1.3 
CMP Slurry Type 
Polishing Objects Purpose of Application Slurry Types 

SiO, Planarization, trend 
FSG favoring introduction of Fumed silica is mainly 

Interlayer dielectric processing below 0.35 um used, tendency of 
DES. switching to ceria 
Low-k 


STI (shallow trench isolation) 


Intra-semiconductor 
devices separation layer 
elimination; applied 
below 0.25 um (128, 
256M) 


Ceria, fumed /colloidal 
silica 


Formation of trench- 


Polysilicon shaped capacitor, Fumed /colloidal silica 
polysilicon is eliminated 
Wire formation (W is Fumed silica is mainly 
W eliminated) used; alumina 
development phase 
Al Contact plug Tormiatior Fumed silica, alumina 
Metal (Al elimination) 
Wiring and plug A Fumed/colloidal silica, 
simultaneous formation : 
Cu . alumina, MnO,, and 
(dual damascene: Cu is 
soe others 
eliminated) 
1.2.3 Pad 


Generally, polymer of polyurethane type is used for pads. Uniform surface 
roughness and porosity of pad influence the characteristics of WIWNU 
(within-wafer non-uniformity), WIWNU (wafer-to-wafer non-unifor- 
mity), and LTLNU (lot-to-lot non-uniformity). For this reason, chemical 
technology durability, hydrophilic, and viscoelastic features differ accord- 
ing to each required CMP process condition. Table 1.4 shows currently 
used pads types and the CMP process to which they are applied. 


1.2.4 Slurry Supply Equipment and Filtering Equipment 


The CMP process has a higher possibility of defects than other processes 
because it uses abrasive in slurry. It especially causes scratches; therefore, 
controlling the defects is important. To repress scratches attributed to 
slurry, filter is generally placed at the supply system, circulation loop, and 
point of use (POU). These factors can be mixed diversely according to the 
polishing machine’s structure or processing condition selection. However, 
other materials are also influenced because of the correlation when a factor 
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TABLE 1.4 
Typical Applications for Different Pad Types 
Type 1 Type 2 Type 3 Type 4 
Structure Felt fibers Porous film Microporous Nonporous 
impregnated coated ona polymer sheet polymer sheet 
with supporting with surface 
polymeric substance macrotexture 
binder 
Pad examples Pellon™, Polytex™, IC1000™, OXP3000™, 
Suba™ Surfin™, IC1010™, IC2000™ 
UR100™, 1C1400™, 
WWP3000™ FX9™, MH™ 
Typical Si stock polish, Si final polish, Si stock, ILD ILD CMP, STI, 
applications tungsten CMP tungsten CMP, CMP, STI, metal dual 
post-CMP buff metal damascene 
damascene 
CMP 


changes. Therefore, each CMP process should be controlled appropriately 
because polishing target film and processing can be changed. 

All CMP processes applied to semiconductor manufacturing processes, 
including STI CMP, are formed around the CMP machine. The surface of 
the wafer and a pad are contacted by pressure of its own load of a head 
part. At this time, a pad attached to the polishing table makes a simple 
rotary movement, and a head makes a rotary movement and shaking 
movement at the same time (Figure 1.5). The wafer exerts a regular pres- 
sure on the polishing table. Consumables are liquid slurry, a pad, and a 
cleaner and others conformable to each target substance. At this point, 
abrasive of the inside of slurry and the wafer device flow into interface 


Insert Film 


\ E Carrier 
N 1 
N 7 


Polishing Pad 


Platen 


I 


Y y 
Slurry 


FIGURE 1.5 (See color insert) CMP process of manufacturing. 
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space to make contact at the overhang. Pressure is concentrated at this 
protuberant part. Therefore, it has a relatively higher speed of surface 
elimination than other parts. Also, the protuberant part is equally elimi- 
nated over the surface while processing is in progress. 

Except for STI CMP, 50 nm processing; technology CMP can be divided 
into interlayer dielectric (ILD) CMP and metal CMP. Chapters 2 and 6, 
respectively, will discuss these two processes. The 50 nm CMP process- 
ing rate rapidly augments through multiple metal lines structure for high 
integrated circuits and is recognized as a necessary process for formation 
of various detail patterns. The existing CMP slurry is becoming limited to 
future applications because the device design rule is going below the 50 nm 
level, and new materials and structures are emerging. In addition, struc- 
tures that can be embodied only through CMP processing are appearing. 

In the case of dynamic random access memory (DRAM), the top elec- 
trode used in capacitor for a device's high speed raises the necessity of 
noble metals like ruthenium (Ru), platinum (Pt), and iridium (Ir), which 
have low electric resistance and are mechanically and thermally stable. 
These noble metals are also chemically very stable and it is not easy to 
form capacitors by the etch back process. That is why noble metal CMP is 
compulsory. However, Ru is divided during the CMP process as a conse- 
quence of poor adhesion of leakage of cap oxide, grain growth of Ru, and 
cap oxide. To protect this phenomenon, the application of new functional 
slurry is essential. 

NAND flash memory started to apply floating gate to increase the capac- 
ity of a device from 65 nano processing technology. After STI CMP process- 
ing in the gate formation area, silicon nitride is stripped. After Si is placed 
between the device manufacture areas using self-aligned poly (SAP) method, 
polysilicon floating gate is formed through CMP process. The poly gate iso- 
lation process using CMP raises many problems in applying the existing 
CMP slurry because of the soft characteristic of poly Si and polymeric reac- 
tion, despite simple processes like planarization after deposition. 

Manufacturing of smaller devices necessitates the introduction of new 
materials and processes. The role of CMP is expanding and its importance 
is also being augmented. As a result, slurry production companies and 
laboratories are actively processing developments and researching con- 
sumable and optimized processing. 


Z 
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2.1 Interlayer Dielectric (ILD) CMP Process 


In the deposition of the interlayer dielectric (ILD) film to break off relations 
wiring and wiring, step height is formed because the deposition aspect 
becomes different along the shape of the lower part pattern. Without 
removing induced step height in the wiring process, the limit of expo- 
sure is caused passing over the depth of focus (DOF) margin during the 
lithography process. Therefore, the global planarization process is essen- 
tial after each layer is insulated, and this CMP process is designated as 
ILD CMP or intermetal dielectric (IMD) CMP. The ILD CMP process has 
been used to polish plasma-enhanced tetraethylorthosilicate (PETEOS) 
or high-density plasma chemical vapor deposition (HDPCVD) film on 
deposited silicon wafers. Figure 2.1 shows the ILD CMP process. As the 
manufacturing technique of the semiconductor device is developed, the 
number of levels in an interconnect technology is increased. To obtain 
the multilevel interconnection, the surface of wafer must be planarized to 
prevent topography roughness from growing with each level as shown in 
Table 2.1. 


2.2 Rheological and Electrokinetic Behavior of 
Nano Fumed Silica Particle for ILD CMP 


In the ILD CMP process, the most important factor is the characteristics 
of nano fumed silica slurry. The chemical interactions and physical prop- 
erties of nano ceramic particulates must be considered to planarize the 
surface of wafer successfully. The dispersion stability of nano fumed silica 
slurry is directly related to the polishing rate (removal rate), the surface 
scratch, and the uniformity (within-wafer non-uniformity) of wafer sur- 
face across the whole wafer. Controlling the dispersion stability of nano 
fumed silica slurry is a key parameter in the ILD CMP process. 


10 Nanoparticle Engineering for Chemical- Mechanical Planarization 


2 

PETEO . PETEO Z 1.3 um 
Ti 0.25 um Fi 0.25 um 
Al-0.5%Cu 0.75 um Al-0.5%Cu 0.75 um 
x 0.25 um < 0.25 um 

PETE T E PETE E . 
O 0.8 um CMP O 0.8 um 
Before ILD CM After ILD CMP 


FIGURE 2.1 (See color insert) Schematic of ILD CMP process. 


2.2.1 The Unique Behavior of Concentrated 
Nano Fumed Silica Hydrosols 


Nanosize inorganic particles (i.e., below 100 nm) are gradually being incor- 
porated into a broad range of advanced devices and applications. Some 
examples include silicon (Si) wafer polishing, planarization for semicon- 
ductor manufacturing (CMP process), electronic packages, ultrathin-film 
optical devices, advanced fuel cell catalysts, molecular conductors, and bio- 
chips. Recent evidence has indicated that classical colloid principles might 
not fully explain the complex behavior of concentrated nanosols. 

According to the Derjaguin-—Landau-Verwey—Overbeek (DLVO) theory, 
a cornerstone of modern colloid science, two types of forces exist between 
colloidal particles suspended in a dielectric medium: (1) electrostatic 
forces, which result from unscreened surface charge on the particle; and 
(2) London-van der Waals attractive forces, which are universal in nature. 
The colloidal stability and rheology of oxide suspensions, in the absence 
of steric additives, can be largely understood by combining these two 
forces (assumption of additivity). 

There are several reports of the unique stability of nanosize silica hydro- 
sols near the isoelectric point (IEP). The Canberra group experimentally 
discovered the existence of short-range forces that play an important 
role in the interaction process and must be added to those forces already 
accounted for by the original DLVO theory. These short-range interactions 


TABLE 2.1 
Roadmap for ILD CMP 

Year of Production 2006 2007 2008 2009 2010 2011 
DRAM % pitch (nm) 70 65 57 50 45 40 
Flash % pitch (nm) 64 57 51 45 40 36 
CMP Performance 
Dishing (A) <500 <400 <300 
Erosion (A) <1000 <500 <300 
Uniformity 5% 3% 2% 


Defect (um) <0.13 <0.10 <0.08 
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are referred to as structural forces. Structural forces might explain some 
particular aspects of the stability behavior of silica nanosols, but they are 
insufficient to account for the apparent cooperative effects of solids load- 
ing and electrostatic found in the present study. Contrary to suspensions 
based on colloidal-size (100-1000 nm) silica and other inorganic oxides as 
reported in the literature, we found that the rheological behavior of concen- 
trated electrostatically stabilized silica nanosols is counterintuitive with 
regards to the predictions based on a standard interpretation of DLVO 
theory. Despite the high surface charge density electrokinetic potential at 
pH 8, nano fumed silica particles not only showed unstable rheological 
behavior that would normally indicate an unstable or aggregated suspen- 
sion (i.e., pseudoplastic high viscosity), but the rheology did not have the 
expected dependence on ionic strength. In this chapter, experimental mea- 
surements, DLVO calculations, and simple geometric considerations are 
used to understand the influence of solids loading and the electrical double 
layer on the rheological behavior of concentrated silica (20 nm) nanosols, 
and to compare their behavior with that of much larger silica microspheres, 
as well as like-sized nano-alumina, under similar conditions. 


2.2.2 Electrokinetic Behavior of Nano Silica Hydrosols 


By changing the pH, one can alter the magnitude (and sign) of the zeta 
(G) potential, whereas the addition of an inert electrolyte will affect both 
the magnitude of ¢ and the electrical double-layer thickness. Thus, both 
pH and electrolyte concentration will directly impact colloidal stability 
in an electrostatically stabilized system. Figure 2.2 compares £ potential 
and viscosity (at a shear rate of 26.4 s) as a function of pH for the nano- 
sized fumed silica and the silica microspheres suspensions. The average 
primary particle sizes were 20 nm for the nanosized fumed silica and 500 
nm for silica microspheres. Even at a solids concentration of 20%, the silica 
microspheres exhibit a fairly constant and low viscosity across the entire 
pH range, whereas nanosized fumed silica exhibits a strong pH depen- 
dence at a volume fraction of 13.2% with an increase in viscosity near pH 
7 in excess of 300mPa s. Figure 2.3 shows the effect of inert electrolyte 
concentration on viscosity as a function of shear rate for highly charged 
13.2% nanosized fumed silica at pH 8. 

Figure 2.2 indicates that for the silica microspheres, the ¢ potential and 
viscosity both follow the expected behavior predicted by the classical 
DLVO theory. On the other hand, the nanosized fumed silica exhibits a 
discrepancy between the expectation of DLVO theory and the experimen- 
tal results, that is, as ( of the nanosized fumed silica increases, viscos- 
ity sharply increases. Hence, factors such as particle crowding, particle 
ordering, and electroviscous effects will also impact viscosity, in addition 
to aggregate or network formation. 
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FIGURE 2.2 The relationship between zeta potential (open shapes) and viscosity (filled 
shapes) for silica suspensions as a function of suspension pH: nanosize A90 versus Geltech 
microspheres (G). Viscosity was determined at a shear rate of 26.4 s. Particle volume frac- 
tion given in percent. 
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FIGURE 2.3 The effect of electrolyte concentration on the viscosity of 13.2% A90 silica at pH 
as a function of shear rate. 


2.2.3 Geometric Consideration 


To more properly analyze the results of Figures 2.2 and 2.3, it helps to first 
layout the physical dimensions of the system as depicted in Figure 2.4. The 
mean interparticle center-to-center separation distance (d..) is defined as 
d,/013, where d, is the primary particle diameter and © is the particle 
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d; ~ dp ~ Fee = 4 deze ~ 20 nm 


FIGURE 2.4 Diagram illustrating the relationship between average interparticle surface- 
to-surface separation distance, d,, and other system dimensions, for a particle diameter d; 
= 20 nm and O = 13.2%. 


volume fraction. Then the mean interparticle surface-to-surface separa- 
tion distance (d,) is d.,.—d,. As ® increases, the system dimensions, d, and 
dy eventually become of comparable length (d,/d, ~ 1), which can lead to 
constrained motion and excluded volume effects. That is, other particles 
may be excluded from the interparticle space once the average separation 
distance is of the order of the particle size, thereby reducing the number of 
possible positions each particle is able to sample during Brownian motion. 
Furthermore, each particle with a surrounding volume of liquid defines 
a spherical cell. Figure 2.5 shows the average cell radius, reey = d.,./2, and 
d,as a function of ® and d,. As d, decreases or ® increases, d, becomes 
smaller. This has important implications for nanosize particles, and helps 
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FIGURE 2.5 Calculated average cell radius (opened symbols) and surface-to-surface sepa- 
ration distance (filled symbols) as a function of particle volume fraction and particle size 
for silica. 
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explain why it is so difficult to obtain low-viscosity concentrated nanosols 
in aqueous systems. This explanation may not be immediately obvious 
since the critical ® corresponding to d,/d, = 1 occurs at about 13%, irre- 
spective of particle size. 

However, the distance over which hydrodynamic and electrostatic 
forces act in solution is more or less independent of particle size at first 
approximation. As a result, when the average separation distance between 
particles is rather large, these forces dissipate before they can influence 
neighboring particles. As a result, particle motion is independent and the 
rheological behavior is Newtonian so long as the particles remain stable 
and do not aggregate. On the other hand, as the average separation dis- 
tance is reduced, these forces begin to influence nearest neighbors, and the 
motion of nearby particles becomes coupled. Coupling leads to an increase 
in suspension structure, which provides an additional mechanism for vis- 
cous dissipation. In aqueous nanosols, the effects of electrostatic forces on 
structure can be particularly strong as d, and d, approach the length scale 
over which short-range repulsive interactions are active. 


2.3 Particle Engineering for Improvement 
of CMP Performance 


ILD CMP typically uses a fumed silica slurry dispersed in an aqueous 
medium at pH near 11. Fumed silica is a widely adapted abrasive for ILD 
CMP because of its inexpensive price, high purity, and colloidal stability. 
However, fumed silica is difficult to disperse in an aqueous system, and 
to control powder processing, because of a large specific surface area of 
90 + 15 m?/g, making it very reactive. ILD CMP slurry was prepared at 
pH 11 to accelerate the chemical attack on the deposited PETEOS film on 
the wafer surface. But silica particles dispersed in aqueous media are par- 
tially dissolved at pH 11. Consequently, the removal rate decreased and 
microscratches were generated on the wafer surface as due to agglomera- 
tion of silica particles as surface potentials decreased. 


2.3.1 Surface Modification of Silica Particle 


As mentioned earlier, the dispersion stability of the slurry is directly 
related to CMP performance as removal rate—within-wafer non-uni- 
formity (WIWNU), which is defined as the standard deviation divided 
by the average of remaining thickness after CMP, microscratch, and the 
remaining particle on the wafer. The agglomeration of particles causes low 
removal rate and the remaining particles of the deposited film surfaces. To 
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TABLE 2.2 
Dissolution Amount of Si Ions with and without Surface Modification 
With Modification Without Modification 
2,050 ppm 40,000 ppm 
Amounts 
0.070 + 0.001 mol/L 1.370 + 0.002 mol/L 


avoid poor CMP performance, the dispersion stability of the slurry must 
be controlled by prevention of Si ion dissolution from SiO, film surface. 
The amount of Si ions dissolution is found to be dependent on surface 
modification, which was confirmed by inductively coupled plasma-atomic 
emission spectrometer (ICP-AES) analysis. Table 2.2 shows the dissolution 
amount of Si ions with and without surface modification of fumed silica 
slurry. Without surface modification, the amount of Si dissolution was 1.370 
+ 0.002 mol/L, whereas surfaces modified with poly(vinylpyrrolidone) 
(PVP) polymer yielded a dissolution of 0.070 + 0.001 mol/L, almost 20 
times less than the unmodified surface. Figure 2.6 represents the electro- 
kinetic behavior of silica characterized by electrosonic amplitude (ESA) 
with and without surface modification. When PVP polymer modified the 
silica surface, dynamic mobility of silica particles showed a reduction 
from -9 to -7 mobility units (10-8 m?/Vxs). Dynamic mobility of silica par- 
ticles lacking this passivation layer shows that silica suspensions exhibit 
negative surface potentials at pH values above 3.5, and reach a maximum 
potential at pH 9.0. However, beyond pH 9.0, the electrokinetic potential 
decreases with an increasing suspension pH. This effect is attributed to a 
compression of the electrical double layer due to the dissolution of Si ions, 
which resulted in an increase of ionic silicate species in solution and the 
presence of alkali ionic species. When the silica surface was modified by 
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FIGURE 2.6 Electrokinetic behavior of silica suspensions with and without additive. 
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TABLE 2.3 
Removal Rate and WIWNU with and without Modification 
Removal Rate (A/min) WIWNU (%) 

Wafer With Without With Without 
Number Modification Modification Modification Modification 
1 2822 2873 3.57 8.09 
2 2705 2767 3.75 8.62 
3 2791 2813 3.96 9.43 


PVP polymer, the decrease of electrokinetic potential above pH 9.0 disap- 
peared, resulting in an increase of the stability of particles dispersed in 
the alkaline pH region. 


2.3.2 Improvement of ILD CMP with Modified Silica Slurry 


Table 2.3 shows the removal rate and WIWNU of silica slurry with and 
without surface modification. In comparing the results with and with- 
out the modification, the removal rate is similar, but the final WIWNU of 
modified slurry is better than that of nonmodified slurry. Removal rate 
and final WIWNU results were closely correlated to the surface poten- 
tial, rheological behavior, and large-particle size distribution. In effect, 
the surface modification strongly influenced the suspension stability and, 
hence, the properties of wafer uniformity. The microscratch and remain- 
ing particles on the silicon wafer with and without the surface modified 
slurry are shown in Figure 2.7. 

Microscratches and remaining silica particles on a wafer for the modi- 
fied slurry is much lower than those for nonmodified slurry. PVP, which 
modifies the silica particles and plays a preventive role in dissolving Si 
ions, is thought to improve the suspension stability. Due to the surface 


FIGURE 2.7 (See color insert) Analysis of remaining silica particles (particle size > 0.189um) 
on silicon wafers after post CMP cleaning: (left) modified slurry, (right) nonmodified 
slurry. 
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modification, microscratches on the silicon wafer were decreased, as the 
improved suspension stability prevented the undesirable agglomeration. 
Additionally, as the reactivity of Si ion with the silicon wafer is much 
higher than that of silica particle, the stuck particles on the wafer surface 
decreased owing to reduction of the amount of Si ion dissolution in the 
case of the slurry modified with PVP. 


2.4 PAD Dependency in ILD CMP 


Lee et al. (2000) have systematically reported nanotopography impacts on 
oxide CMP, however, they quantified the impact using standard deviation 
of film thickness variation, which itself does not include lateral informa- 
tion. To develop an essential understanding of the relationship between 
the nanotopography and the film thickness variation after CMP, a treat- 
ment is needed not only for the amplitude but also for the wavelength 
component. Fukuda et al. (2000) qualitatively showed the pad depen- 
dency with a comparison between the line profiles of the nanotopog- 
raphy of wafers and the film thickness variations after oxide CMP. The 
pad dependency using two types of pad and the removal depth depen- 
dency is investigated with reiterated polishing followed by film thickness 
measurement. The result is examined by means of the spectral analysis 
method. As the comparison between soft pad and hard pad, the standard 
deviations of filtered film thickness variation of group-G are plotted in 
Figure 2.8. The standard deviation for hard pad test is two times as large 
as the one for soft pad test in the comparison for same removal depth. The 
reason for this trend can be understood from the difference of planariza- 
tion length between two kinds of pad. That is, by the harder pad, more 
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FIGURE 2.8 Standard deviations of the filtered film thickness variation before and after 
CMP. The pad dependency and removal depth dependency is plotted. 
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FIGURE 2.9 (a) A cross-sectional scheme for progressing polishing of oxide film. The sur- 
face of oxide film is gradually smoothed. (b) The transition of film thickness variation dur- 
ing polishing. 


longer-wavelength components of surface waviness can be smoothed 
efficiently, so the planarization of the surface occurs more quickly with a 
smaller removal depth) than by the softer pad. Then the impact of nano- 
topography is more severe if compared for the same removal depth. 

In Figure 2.8 the removal depth dependency is also shown. The increas- 
ing of the filtered film thickness variation with larger removal depth can 
be understood from Figure 2.9, where the oxide surface is gradually pla- 
narized during polishing. 

Correlation between the standard deviation of nanotopography profile 
and the film thickness variation before/after CMP are shown in Figure 2.10 
(for soft pad) and Figure 2.11 (for hard pad). It is reasonable that film 
thickness variations before CMP were independent of nanotopography. 
However, after CMP, the film thickness variation and nanotopography 
have positive correlation. Here two facts are pointed out. 


1. Pad dependency: The slope in Figure 2.11 is steeper than in 
Figure 2.10 if compared for same removal depth. 


2. Removal depth dependency: The slope gets steeper as the removal 
depth increases. 


In other words, the nanotopography impacts on film thickness variation 
get more severe with the larger removal depth. These facts are consistent 
with the result shown in Figure 2.8 and it can be estimated how much the 
nanotopography depends on the film thickness variation. 

The power spectral densities (PSDs) of nanotopography and film thick- 
ness variation are drawn in Figure 2.12. The PSDs of film thickness varia- 
tion before CMP were much smaller than that on nanotopography with 
a factor of 100 or 1000 for all groups of wafer type. The PSD of film thick- 
ness variation is getting closer to that of nanotopography as the polish- 
ing goes on. This is consistent with the model in Figure 2.9. This trend 
is more remarkable for shorter wavelength region and it corresponds to 
descending transfer function mentioned in Section 5.3.2. As for the pad 
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FIGURE 2.10 Correlation between standard deviations of nanotopography and film thick- 
ness variation for soft pad test. 


140 


9 Rmv.depth: OA 
120 O Rmv.depth: 1507A 


< f A Rmv.depth: 3014A 
E 100 E O Rmv.depth: 4461A 
= F y = 0.3706x + 20.936 
L Pee, 
3 so) AOS A Y = 0.2966x + 8.8307 
$ t R? = 0.735 
X | A 
2 60 
vo L 
S Il 38 O Y= 01829x + 11.149 
= [ R? = 0.7294 
E 40H 2 
E Ff AG 
E | E 
20 ER e 


y = -0.0012x + 3.2458 


a R? = 0.00: 
0 1 1 1 1 1 1 i ni ni 1 1 1 1 1 


0 100 20 300 400 
Nanotopography Profile 1 Sigma (A) 


FIGURE 2.11 Correlation between standard deviations of nanotopography and film thick- 
ness variation for hard pad test. 


dependency, with comparison for same removal depth, the PSD for hard 
pad is larger than that for soft pad in long wavelength. For some cases of 
large removal depth the PSD of film thickness variation exceeds that of 
nanotopography in short wavelength. It can be attributed to non-unifor- 
mity, which comes from other than nanotopography (e.g., pad non-unifor- 
mity), however it is not dominant. 
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2.5 ILD Pattern Dependencies 
2.5.1 CMP Tool Dependency 


CMP remains hampered by systematic and random interlayered dielectric 
(ILD) thickness variation at the wafer and die level. Pattern dependencies 
within the die, in particular, have been of concern for both manufactur- 
ability and product design. 

Since the wafer- and die-level sources are deeply confounded, it is 
difficult to characterize the tool dependencies until these sources are 
decomposed. After application of variation decomposition techniques, 
the die-level variation can be analyzed for its pattern dependencies. 
Divecha et al. (1996) have shown that for similar polishing pad and pro- 
cessing conditions (e.g., platen speed, back/head pressure, and spindle 
speed) between the two tools, the die-level variation is similar and is 
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FIGURE 2.13 Capacitance test structures. 


fundamentally dependent upon the underlying topographies, whereas a 
substantial wafer-level variation results. 

Statistical metrology is a methodology for the systematic assessment 
and quantification of the sources of variation in a given semiconductor 
manufacturing process. The methodology requires a large number of 
measurements for statistical modeling. It also emphasizes the design of 
experiments to develop electrical test structures, use of short flow pro- 
cesses to ensure minimum variation in the final parameter from the con- 
founding interactions between processing steps, and close coupling to 
technology computer-aided design (TCAD) tools necessary for extracting 
the desired parameters from electrical measurements. The test structure 
used in this experiment is a metal-to-metal capacitor to infer the ILD thick- 
ness as shown in Figure 2.13. The capacitor test structure has a uniform 
top electrode and bottom electrode consisting of various combinations of 
layout factors such as line width and spacing, finger length, the number 
of fingers, geometric orientation, and presence or absence of an interaction 
ring around the structure. 

Combinations of six layout factors form a half-fractional factorial exper- 
iment yielding 32 unique structures. Four structures are put together in 
a subdie layout shown in Figure 2.14 with corresponding resistive struc- 
tures to account for local line width variation. The subdie layout is rep- 
licated four times within the die to obtain spatial mapping. Figure 2.15 
shows the 1.45 cm x 1.45 cm short-loop test die. The fourth quarter of the 
die includes large uniform density intensive structures to study the area 
dependence, and also serve as dense patterns in the die. Test wafers were 
processed in a short-flow Metall-Metal2 process, with half of the wafers 
being polished on commercial tool A and the other half on commercial 
tool B. Slightly different slurries and pad conditioning techniques were 
used, but the polishing pads, back/head pressure, platen speed, and spin- 
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FIGURE 2.14 Probe layout. 


FIGURE 2.15 CMP/ILD thickness die layout. 


dle speed were similar on the tools. The ILD thickness data were extracted 
from AC high frequency (100 kHz) capacitance measurements. 

The ILD thickness variation sources can be categorized into wafer-level, 
die-level, die and wafer interaction, and residual terms. The wafer-level 
variation is often caused by process perturbation and drifts in equipment 
and consumables, and is relatively invariant of pattern density and other 
layout effects. On the other hand, the die-level variation is attributed to 
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FIGURE 2.16 (See color insert) Wafer level variation for tool A. 


the layout patterns within a die. Statistical methods were used to sepa- 
rate these two components. Figure 2.16 and Figure 2.17 depict the wafer- 
level variation extracted from a typical wafer polished on tool A and tool 
B, respectively. The effect of wafer edge and flats can be discerned from 
these figures. Both tools exhibit substantial wafer-level (or interdie) varia- 
tion. These results are quite striking given that all the wafers from one 
lot were fabricated using the same deposition process. Clearly, the equip- 
ment factors, different pad conditioning techniques, slurry distribution, 
and other macroscopic physical effects during polishing contribute to this 
wafer-level non-uniformity. 

Figure 2.18 and Figure 2.19 show the die-level variation held common 
between all dies on the wafer. The die-level (or intradie) ILD thickness 
variation pattern is found to be nearly identical. Both tools exhibit a simi- 
lar pattern “signature” at the die level with the primary difference being 
relative attenuation in the magnitude of the variation. We attribute most 
of this attenuation to a difference in the total thickness of oxide removed. 
The denser structures on the fourth quarter of the die have less variation 
and are polished slower compared to the rest of the features. These results 
indicate that feature/pattern scale variation appears to be largely deter- 
mined by pad and layout characteristics and are only weakly impacted 
by process conditions. Figure 2.20 shows a quantitative comparison of 
extracted die-level variation components for tool A versus tool B. The 
correlation coefficient for tool A versus tool B is 97%, and the magnitude 
of the slope indicates the difference in attenuation of the variation. A 
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FIGURE 2.18 (See color insert) Die variation for tool A. 


qq-norm plot also revealed that the residuals from the linear fit shown 
in Figure 2.20 are normally distributed. The explicit decomposition and 
modeling of wafer- and die-level variation, especially across different 
tools and consumable sets, can be extremely useful as part of a program 
to reduce pattern-sensitive effects in CMP. 
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FIGURE 2.19 (See color insert) Die variation for tool B. 
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FIGURE 2.20 Die variation for tools A and B. 


2.5.2 Pattern Density Dependency 


Particularly, CMP characteristics are strongly dependent on the pattern 
density of the chip layout. Although CMP can planarize over long-length 
scales, the pattern density variation across a chip leads to large variation 
in global thickness across the die. The initial difference of layout pattern 
density between two regions creates a global step height across the two 
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regions due to the difference in removal rates before the local patterns 
are planarized. The global thickness variation also impacts circuit perfor- 
mance: the long-range clock wires passing through the regions of differ- 
ent thickness result in different capacitances and may result in clock skew. 
Detailed understanding of the polishing process and subsequent improve- 
ments with better consumables and the process control, which minimize 
the pattern-density-dependent variation across a chip, must be employed. 
It has thus become a common practice to use dummy fill structures across 
a chip to minimize the pattern density variation. The dummy fill refers to 
the introduction of extra metal lines or blocks along with the actual metal 
interconnection. The main purpose of using the extra metal is to reduce 
the pattern-dependent ILD thickness variation across a chip by reduc- 
ing the pattern density variation. Introduction of a dummy pattern may 
increase the circuit capacitance, thus it is important to minimize the using 
of the dummy pattern. In order to know how to include dummy patterns 
efficiently, preliminary polishing experiments should be done with spe- 
cially designed wafers having various pattern densities and shapes. 

Kang etal. (2001) investigated the characterizations of pattern-dependen- 
cies in ILD CMP using 8-inch SKW1 wafers designed by SKW Associates. 
The SKW1 density mask has structures with varying local pattern densi- 
ties from 4% (lower-left corner) to 100% (upper-right corner) consisting; of 
25 blocks, as shown in Figure 2.21a. Each block is 4 mm x 4 mm with a 
fixed pitch of 250 um, and the pattern density is increasing gradually at 
increments of 4%. Figure 2.21b shows the image of this layout of the die, 
and Figure 2.21c shows the cross-sectional view of the SKW1 dielectric 
CMP process characterization wafer. 

The process conditions for this experiment are listed in Table 2.4. An 
IC1000/Suba IV K-Grooved stacked pad from Rodel Co. and an SS25 slurry 
from Cabot Co. were used. The thicknesses of prepolished and polished 
wafers were measured across the wafer by an Opti-Probe™ 2600DUV 
from Therma-Wave. To obtain within-die non-uniformity (WIDNU), 25 
measurements per die were done over the metal (not between metal lines) 
near the center of each density block as shown in Figure 2.22. Five dies 
a wafer were measured to compare WIDNU at different positions on a 
wafer. To obtain within-wafer non-uniformity (WIWNU), 30 dies across 
the wafer measuring one site per die on the same density block of 52% 
were measured. In this case, not all 25 sites per die are necessary. Only one 
measurement per die is required on the same density block. The sampling 
scheme of these thickness measurements is depicted in Figure 2.23. 

Figure 2.24 shows the removal rate variation as a function of pattern 
density for five dies with the positions shown in Figure 2.23a. This trend 
indicates that the removal rate decreases linearly as the pattern density 
increases. It can intuitively be explained by Preston’s equation, R = k,pv, 
where k, is the Preston coefficient, R is the material removal rate, p is pres- 
sure, and v is relative velocity. As pattern density increases, the effective 
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FIGURE 2.21 SKW1 density mask structure and layout. 
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TABLE 2.4 
Process Experiment Conditions 
Down Table Spindle Back Slurry 
Number Force Speed Speed Pressure Flow Rate Time 
1 7 psi 30rpm  50rpm 0 psi 200 ml/min 246 sec 
2 7 psi 30rpm  50rpm 0 psi 200 ml/min 295 sec 
Measurement , Not here 
Site a 
25 nm 
0.75 um 
25 nm 
08 pm 


Si Wafer 


Cross Sectional View 


(a) 


FIGURE 2.23 Thickness measurement sampling scheme: (a) five die measurements for die- 
level variation and (b) thirty die measurements for WIWNU. 


contact area between the pad and wafer increases, and then the effec- 
tive local pressure p becomes lower, resulting in a reduced removal rate. 
However, since process time in this experiment is longer than the time 
needed to completely remove step heights, the removal rates in Figure 2.24 
are not directly given to real polishing rates for the pattern densities. 
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FIGURE 2.24 Removal rate variation as a function of pattern density. 
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FIGURE 2.24 (continued) 


Figure 2.25a shows the removal rates for five dies located at different 
positions in a wafer. It represents that the removal rate of the center of 
the wafer is relatively lower than that of the edge. This result cannot be 
explained simply because this kind of non-uniformity can be caused 
by a lot of factors such as machine characteristics, machine conditions, 
consumables, and wafer shape. The WIDNU is about 20% as shown in 
Figure 2.25b, which is inevitable because there are various pattern densi- 
ties in a die for the practice of pattern density effect. Therefore, we can 
see that the pattern density in a die must be kept uniform in real CMP 
process. 

The average WIWNU for SKW1 wafers is 6.4% as shown in Figure 2.26. 
Intuitively, the more the removal thickness, the better planarity achieved; 
however, at the same time, the across-wafer final thickness non-unifor- 
mity becomes worse. 

To predict the final oxide thickness in a pattern wafer, we used a model 
known as the MIT model proposed by Stine et al. (1997). Figure 2.27 
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FIGURE 2.25 (a) Removal rates and (b) WIDNU for five die measurements. 
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defines the terms used in the model. In this model, the pattern density is 
considered as a function of time because the deposited film has a different 
surface area with the polishing time. 

Preston's equation can be reformulated as: 


p(x, y,z) (2.1) 


where K is the blanket-polishing rate and p(x, y, z) is the effective pattern 
density. The equation is then solved for the oxide thickness z under the 
assumption that no “down area” polishing occurs until the local step, z,, 
has been removed, after which the pattern factor is turned off. This is 
given by expressing the effective density as follows : 
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FIGURE 2.26 WIWNU for SKW1 wafers. 
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FIGURE 2.27 Definition of terms used in the basic model. 


Po(x,y,Z) Z>Z =Z 


pan] (2.2) 


1 Z<Zy)—-Z, 


In the determination of the effective density, p(x, y, 2), the effect of lat- 
eral deposition is accounted for by adding a bias term to the metal lines, 
which constitute the mask layout pattern. This ensures that the effective 
density is that of the final film profile and not the initial mask layout. It is 
assumed that the local pattern density is independent of the film thickness 
before the local planarity approximates the actual deposition profiles with 
a vertical profile. In reality, the effective density of the exposed surface 
depends on the height; it is possible to “time step” the profile evolution to 
account for such a time-varying density, but such detail is not essential for 
the prediction of final oxide thickness. The assumption makes it possible 
to express the final film thickness for any time, t, in a closed form as: 


(x,y,z)  Z>Z —Z 
asa d R (2.3) 


1 Z<Zo— Z 
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Before the local planarity is achieved (i.e., while the local step height still 
exists), the final film thickness is inversely proportional to the effective 
local density. The film is assumed to be polished linearly at the blanket 
rate afterward. Based on this closed form of Equation 2.3, the residual 
oxide film thickness was calculated. Our simulation results agree well 
with the experimental data as shown in Figure 2.28. As a result, we can 
conclude that this model is very useful to correctly predict the remained 
thickness after CMP polishing. 

The effects of the pattern density on CMP characteristics using 8-inch 
SKW1 wafers from SKW Associates, which were specially designed for the 
characterization of pattern dependencies in ILD CMP, were investigated. 
The removal rates for various pattern densities and uniformities were 
evaluated and analyzed after CMP. The experimental result shows that the 
removal rate decreases linearly as the pattern density increases and these 
different removal rates for pattern densities cause bad WIDNU. It shows 
that a dummy pattern must be employed to minimize pattern density vari- 
ation. However, the introduction of a dummy pattern may increase circuit 
capacitance, thus it is important to minimize the addition of dummy pat- 
terns. Therefore, to limit the removal rates across a die within reasonable 
values, we must determine what range of the pattern density is available 
in the die at the target residual thickness. Using a simple model that can 
take pattern density into consideration, the remaining oxide thickness 
was calculated and compared with the experimental data. 
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FIGURE 2.28 Experimental and calculated residual oxide thickness. 
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Shallow Trench Isolation CMP 


3.1 Requirement for High Selectivity Slurry 


Shallow trench isolation (STI) is a relatively new technique that is replac- 
ing local oxidation of silicon (LOCOS) for the manufacture of 64 MB semi- 
conductor devices with a linewidth below 0.25 um. Figure 3.1 shows the 
STI CMP process. 

The STI process is defined as: (1) making; a shallow trench to isolate 
active device regions physically, (2) depositing silicon nitride (Si,;N,) on 
oxide films as a stopping layer, and (3) depositing oxide films on the 
trench. Generally, the STI process has the relative capability, compared to 
the LOCOS process, not only to deposit dielectrics to fill trenches isolat- 
ing the active region at low temperature but also to prevent bird’s beak 
and dimensional limitations. That is, STI CMP is essential for separation 
between transistors. If examining STI CMP’s road map (Table 3.1) from this 
vantage point, the removal rate of oxide needs high selective slurry such 
that the nitride removal rate must increase and the number of scratches 
must decrease rapidly. 

The key ingredients to a successful STI process are the achievement of 
well-dispersed abrasive ceramic particles having high oxide-to-nitride 
selectivity and producing few microscratches on the wafer. Silica slurry had 
been conventionally used in the STI CMP process, however, ceria (CeO,) 
slurry with high oxide-to-nitride selectivity has been introduced as the 
thickness of silicon nitride film is decreased by design rule restrictions. 

STI CMP processes with conventional oxide polishing slurries require 
either a reactive ion etching (RIE), etch back preplanarization step, or very 
tight control of the CMP process. Compared with other abrasive slurries, 
ceria slurry has a good selectivity between silicon oxide and silicon nitride. 
There are both chemical and mechanical interactions between ceria par- 
ticles and wafer film during polishing. Nitride film is mainly affected by 
the chemical factors and the nature property of CeO, abrasive particle. 
Ceria slurries offer improved oxide-to-nitride selectivity for planarizing 
the trench fill material while utilizing the nitride film as the polishing 
stop layer. The oxide-to-nitride selectivity is a very important factor in 
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FIGURE 3.1 (See color insert) STI CMP process. 
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TABLE 3.1 
Roadmap for STI CMP 
Year of production 2006 2007 2008 2009 2010 2011 
DRAM % pitch (nm) 70 65 57 50 45 40 
MPU % pitch (nm) 78 68 59 52 45 40 
CMP Performance 
Oxide removal rate [A/min] 3000 + 5000 + 5000 + 5000 + 
500 500 500 500 
Nitride removal rate [A/min] 50 + 10 100~500 100~500 100~500 
+5 +5 +5 
Selectivity (oxide vs. nitride) >35 >10 ~ 50 >10 ~ 50 >10 ~ 50 
Size [nm] 290 265 265 245 
Remaining particle [cm] <0.35 <0.17 <0.17 <0.17 
[# wafer] <238 <116 <120 <115 
Scratch Count [#/wafer, 200 mm] >1 pm 
Oxide dishing [A] 400 400 300 200 
Nitride erosion [A] 50 100 100 100 
Metal contamination [ppma] <1 <1 <0.5 <0.2 


the STI CMP process. It can significantly affect the CMP induced defects, 
such as erosion or dishing, and also be important for endpoint detection. 
Figure 3.2 shows the effect of overpolishing on the characteristics of the 
device. Overpolishing due to nitride erosion or oxide dishing may cause 
degradation of device properties. 
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FIGURE 3.2 (See color insert) The effect of overpolishing on the characteristics of the device: 
(top) the decrease of gate length by overpolishing; (bottom) the shift of threshold voltage in 
the device. 


If the effective gate length (Le) decreases, subthreshold drain current 
(Ip) may increase as shown by: 


I, =p 


C; i -Bvp | „Bbs 1/2 
eh dd je (BY) (3.1) 


and the threshold voltage (Vn) shifts as shown by: 


2y, 2Yp 
1+ 1 14 2-1 
| r | E | (3.2) 


In general, two mechanisms can be applied to improve the selectivity 
between plasma-enhanced tetraethylorthosilicate (PETEOS) and Si¿N, 
during polishing the pattern wafer. One is chemical control using a sur- 
factant to reduce the removal rate of SiN, and the other is mechanical 
control improving the physical properties of ceria particles to enhance the 
removal rate of PETEOS. 
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3.2 Particle Engineering of Ceria Nanoparticles 
and Their Influence on CMP Performance 


The ceria particle is considered to be one of the best glass/SiO, polishing 
abrasives. This is suggested to be due to the reaction between ceria and 
SiO, film, which results in the formation of a chemical “tooth” between 
the silica surface and the ceria particles, and induced localized strain in 
the glass with particle movement. As a consequence, the Si-O-Ce bonds 
can be rapidly removed by the mechanical force generated by a pressed 
pad and abrasive particles. This physicochemical reaction leads to the high 
removal rate of a SiO, film by ceria particles. The physicochemical proper- 
ties of ceria particles, such as crystallinity, particle roughness, and mor- 
phology, depend on the synthesis methods of cerium oxide. In this section, 
the influence of the ceria particles synthesis method on PETEOS and chem- 
ical vapor deposition (CVD) nitride films removal rates are presented. 


3.2.1 Physical Properties of Ceria Particles 


As the solid-state displacement reaction method and wet chemical pre- 
cipitation method were employed for synthesizing ceria powders, the 
characteristics of ceria properties showed different features in several 
experiments. Figure 3.3 shows the morphology of ceria particles observed 
by a high-resolution scanning electron microscope (SEM; S900, Hitachi, 
Japan) and transmission electron microscope (TEM; JEM-2010, JEOL, 
Japan). In these figures, the ceria particles have a polyhedral shape. Both of 
the powders have nearly the same size. The primary particle size is =40 nm. 
However, the difference in crystal shape of the ceria particles was found 
in TEM analysis. Figure 3.4 shows x-ray diffraction (XRD) profiles of ceria 
powders produced by precipitation. The XRD data of the synthesized par- 
ticles shows characteristics of CeO, with a typical fluorite structure. Since 
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FIGURE 3.3 SEM and TEM micrographs of CeO, particles: (a) SEM (100,000 magnification); 
(b) TEM (300,000 magnification). 
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FIGURE 3.4 X-ray diffraction pattern of CeO, powders synthesized by precipitation 
method. 


the starting cerium salt was Ce(NO,) , it required oxidation of Ce** to Ce** 
in the solution. In this system, there is a possible cause for this oxidation. 
According to the Lewis definition of acids and bases, Ce** is a Lewis base 
and Ce* is a Lewis acid. Basic solution, therefore, favors Ce** compared to 
Ce**. The crystallite size was calculated from the Scherrer formula 


D =0.9 / (B cos) (3.3) 


where A is the wavelength of the x-rays, 0 is the diffraction angle, and f is 
the half-width. 

Average crystallite size of CeO, calculated by the Scherrer equation from 
the XRD line broadening was 46 nm for powder A and 34 nm for powder 
B. The crystallite size increases as the calcined temperature increases. 


3.2.2 STI CMP Performance with Ceria Slurries 


Figure 3.5a shows the result of CMP field evaluation. Average PETEOS 
removal rate of slurry A was 2883A/min and B was 672A/min. The within- 
wafer non-uniformity (WIWNU) shows that ceria slurry B (0.7%) is better 
than ceria slurry A (1.9%). Average nitride removal rate of slurry A was 
51A/min and B was 44A/min as shown in Figure 3.5b. Thus, oxide-to- 
nitride selectivity was 56 for ceria slurry A and 15 for ceria slurry B. CMP 
field evaluation of ceria slurries having different crystallinity showed 
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FIGURE 3.5 The result of CMP field evaluation: (a) profiles of PETEOS removal rate; 
(b) profiles of nitride removal rate. 


that slurry A had better crystallinity and smaller pore size and exhibited 
a higher removal rate of PETEOS than B. Ceria slurry A showed higher 
removal rate and better planarization than slurry B. The oxide removal 
rate can be influenced by two CMP processing parameters: mechanical 
grinding and chemical interaction. These mechanisms play simultaneous 
roles in polishing. Concerning the chemical interaction between PETEOS 
and ceria slurry, Si-O-Ce bonding on surface is a dominant mechanism. 
During polishing of the PETEOS film, the SiO, surface would first react 
with CeO, particles and a multiple number of chemical bonding Si-O-Ce 
is formed on the surface. 

Then mechanical tearing of Si-O-Si bonds leads to removal of SiO, or 
Si(OH), as monomer lumps, then the lumps are released from the CeO, 
particles downstream. Highly crystallized ceria particles have a great ten- 
dency to form a bonding between Ce and Si, increasing the oxide removal 
rate. Ceria particles in B are unlikely to have less completed oxidation 
during the wet chemical precipitation and have less hard and less rigid 
surfaces. Therefore, B grains might have little effect in physical polishing 
with PETEOS film. Ceria particles in slurry A has almost fully crystallized 
on the surface after calcination at 800°C. CeO, surface of A would supply 
more potential site to react and bond between Ce and Si than that of B. 
Therefore, slurry A would interact with the oxide layer much more easily 
and readily and hence slurry A has a higher removal rate of PETEOS. 


3.2.3 Influence of Crystalline Structure of Ceria 
Particles on the Remaining Particles 


Figure 3.6 shows the morphology and primary particle size of the ceria 
particles after mechanical milling. The particle size distribution was found 
to be noticeably different between samples based on the TEM images. 
The portion of small-sized particles (<20 nm) in sample B and C were 
larger than sample A. The initial crystallite size of all as-calcined samples 
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FIGURE 3.6 (See color insert) TEM images and nano beam diffraction. 


calculated by the Debye-Scherrer equation was about 26 nm. However, 
the crystallite sizes of the particles after mechanical milling became 25, 
21.5, and 20.3 nm for samples A, B, and C, respectively, which leads to a 
significantly different specific surface area for each particle group. After 
mechanical milling, the particles in all the samples had a similar size 
of 240-260 nm. Therefore, the particles of samples B and C, which were 
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calcined at low oxygen concentration, can be considered to be composed 
of a large number of small crystallite (20.3 and 21.5 nm), while the particle 
of sample A, which was calcined at a higher oxygen concentration, is com- 
posed of relatively larger crystallite (25 nm). 

Using the nano beam diffraction pattern of the high-resolution TEM, 
the crystalline structure of the ceria particles was investigated. The nano 
beam diffraction patterns of the particles (Figure 3.7) indicate that for 
sample A all particles had the cubic fluorite phase of cerium oxide, while 
for samples B and C, which were calcined at low oxygen concentration, 
hexagonally structured particles were included, especially in the smaller 
particles. The calcination process from cerium carbonate to cerium oxide 
consists of a five-step mechanism, including the mass transfer of the react- 
ing agent (oxygen) from the bulk atmosphere to the periphery of carbon- 
ate, its diffusion through the pore channels of the carbonate, adsorption, 
reaction with the cerium carbonate, and the desorption of the reaction 
by-product (carbon dioxide). In this reaction mechanism, several factors 
influence the physical properties of the synthesized particles during the 
calcination process. According to previous reports, a low oxygen concen- 
tration results in a hexagonal phase cerium oxide rather than the cubic 
phase due to the insufficient oxidation of Ce** to Ce**. 

In the absence of external oxygen supply, the hexagonal CeO, phase 
is reported to be observed between 400*C and 500*C and the oxidation 
of CeO, is completed above 800*C, which results in the transformation 
from hexagonal Ce,O; to cubic CeO,. Therefore, it can be considered that 
the hexagonal phase (Ce,O;) in samples B and C was formed due to the 
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FIGURE 3.7 Large particle counts by Accusizer II. 
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insufficient oxidation of Ce** to Ce**, whereas the cubic phase (CeO,) in 
the sample A was obtained by sufficient oxygen supply. 

To investigate agglomeration in the slurries, we measured the number 
of large particles of over 1 um per unit volume by using a slurry particle 
counter; the results are shown in Figure 3.8. Samples B and C with a hex- 
agonal crystalline structure have more large agglomerated particles than 
sample A. Since the Ce** and oxygen vacancies on the surface of the hex- 
agonal ceria particle are linked to anionic vacancies with hydroxyl groups, 
chemisorption and agglomeration easily occurred with the neighboring 
cation-species and bared ceria surface. Moreover, samples B and C have 
too many small particles, which were confirmed to be mainly hexagonal 
crystalline structure (Figure 3.7). The van der Waals attractive force is well 
known to be increased with decreasing particle size; therefore, it can be 
considered that large agglomerated particles in sample B and C are attrib- 
uted to the presence of small-sized particles. 

CMP was performed with the ceria slurry, which was prepared by add- 
ing an adequate amount of commercially available anionic acrylic poly- 
mers (PMAA). Cubic CeO, contained (sample A) and CeO, contained with 
included hexagonal Ce,O; particles (sample C) were used as abrasive par- 
ticles. The removal rate trends along the radius and WIWNU of the oxide 
film are shown in Figure 3.8. In Figure 3.9, the removal rate of oxide film 
shows a remarkable difference between samples A and C. The removal 
rate of the oxide film was dependent on the crystallinity of the ceria par- 
ticle. The polishing of the oxide film was mainly affected by its chemical 
interaction between the ceria particles and the oxide (SiO,) film, which 
were reacted with the hydrated surface to form covalent bonds such as 
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FIGURE 3.8 Removal rate trends. 
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FIGURE 3.9 Within wafer non-uniformity. 


Ce-O-Si, and then pulled off the oxide lumps. As shown in Figure 3.6, 
the particles of sample A have a larger crystallinity and narrower par- 
ticle-size distribution than the particles of sample C. Since the hexago- 
nal structured ceria particles and the agglomerated particles of sample C 
were easily broken apart during the CMP process, these particles do not 
penetrate the viscous layer on the oxide film. Thus, the removal rate of 
sample C was low, resulting in poor oxide-to-nitride selectivity. On the 
other hand, the surface of the silicon nitride film during polishing is pas- 
sivated with an adsorptive surfactant in the slurry, which prevents the 
abrasive from directly contacting the film surface. Hence, the removal 
rates for the SizN, film were not influenced by the crystalline structure of 
the ceria particle. In addition, the slurry with agglomerated particles was 
hardly propagated over the whole wafer surface due to the poor stability. 
Thus, as shown in Figure 3.9, sample C has a higher WIWNU and a lower 
removal rate of wafer edge position than sample A. 

During the CMP, the agglomerated particles were easily stuck to the 
wafer surface by the small interactive force between the abrasive and 
oxide film. It is these sticking particles in particular that induce the surface 
scratches on the wafer due to the compressive and shear forces between the 
wafer and pad. Therefore, agglomerated particles are a major cause for the 
residual particles and the microscratches in the CMP process. The maps of 
the residual particle counts and scratch counts are shown in Figure 3.10a 
and b, respectively. The smaller particles, which were observed in sam- 
ple C, had a high surface activity and specific surface area. As shown in 
Figure 3.10a, the residual particle counts of sample C are much larger than 
that of sample A. These residual particles induced the surface scratching 
during the CMP process. In Figure 3.10b, the scratch counts of sample C are 
also larger than sample A. Consequently, the ceria powders that include 
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FIGURE 3.10 Remaining particle maps on the oxide film measured with the Sufscan 7700. 


hexagonal structured particles were easily broken down to the smaller 
particles and induced particle adhesion on the wafer surface. Therefore, 
the ceria particles should be calcined to a cubic structure through control 
of the oxygen concentration. 


a Se 


3.3 Chemical Engineering for High Selectivity in STI CMP 


STI CMP performance depends on several factors such as deposited nitride 
thickness, trench depth, thickness and type of deposited trench-fill oxide, 
removal rate consistency, the physicochemical nature of CeO, particles, 
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organic additives, and suspension pH. Recently, many scientists have inves- 
tigated the polishing mechanism of SiO, film and the physical properties 
of ceria particles. However, little is known about the removal rate and the 
selectivity between SiO, and SiN, CVD films. There are both chemical and 
mechanical interactions between ceria particles and the deposited film 
and the organic additives. The polishing of the oxide and nitride films is 
mainly affected by the nature of the CeO, abrasive particles and by some 
other chemical factors. The removal selectivity is due to the electrokinetic 
behaviors of the ceria particles and the SiO, and SiN, CVD films in aque- 
ous media. The CMP performance can be influenced by differences in the 
surface potentials of ceria particles with poly(methacrylic acid) (PMAA), as 
dispersant, and the PETEOS and CVD Si,N, films. The surface potentials 
are affected by the suspension pH. This results from the different isoelectric 
pH (pHiep) of the ceria particles with PMAA, SiO, and Si;N, CVD films. 


iep 


3.3.1 Electrokinetic Behavior of the Ceria 
Particle, Oxide, and Nitride Films 


The electrokinetic behaviors of ceria, ceria particles with PMAA, and SiO, 
and Si,N, as a function of pH were investigated to identify the polishing 
behavior in STI CMP. These results are shown in Figure 3.10. The electro- 
kinetic behavior of each particle is reflected in the interaction between the 
ceria particles in CMP slurry and the deposited film to be polished. The 
electrophoretic mobility of all components is strongly dependent on the 
suspension pH. The electrophoretic mobility of silica is negative above pH 
3.4, which is the isoelectric point (pH;ep) of silica. However, at a pH above 
9, the electrophoretic mobility decreases with an increasing suspension 
pH. This is attributed to a compression of the electrical double layer due 
to both dissolution of the Si** ion, resulting in an increase of ionic silicate 
species in the solution, and to the presence of some other alkaline ionic 
species. SiN, has a pH;., of 6.5. Above this pH value, particles carry a net 
negative charge because of the formation of SiO”, which results from the 
dissociation of surface silanol groups. The ceria particles have a pH;., of 
about 7, and a slightly positively charged surface below this pH region. 
However, a shift in the pH;,, of the ceria particles toward the acidic pH is 
produced in the presence of PMAA. There are two causes for this behavior. 
First, the ionization of near-surface segments partially screens the charge 
on the particles, thereby decreasing the shear-plane potential. Second, 
the presence of polymer chains (PMAA) may disturb the hydrodynamic 
plane of shear, shifting it farther out from the particle surface. Because the 
potential decreases exponentially with the distance, the modified shear 
plane will experience a lower potential. Above the pH;,,, electrophoretic 
mobility of ceria particles increased with suspension pH up to the satura- 
tion point. Saturation occurs in the pH region of 6-9, resulting in a neg- 


atively charged particle whose electrokinetic behavior is essentially no 
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longer dependent on the suspension pH. Thus, differences in the surface 
potentials of ceria particles with ionizable acrylic polymer, PETEOS, and 
silicon nitride films were found by electrokinetic analysis. The SiO, film 
and abrasive ceria particles are negatively charged above pH 4, but the 
Si¿N, film is positively charged below pH 7. These results cause the selec- 
tive adsorption of acryl-based polymer. Additionally, the results suggest 
that the effect of dispersion stability on the polishing rate depends on the 
surface potentials between ceria particles and deposited films, which are 
influenced by the suspension pH. Ceria particles have negligible charges 
in acidic pH and thus agglomerate as a result of the weak interparticle 
repulsive forces. The low removal rate in this pH region is due to the 
decrease of contact area between ceria particles and the deposited film. 
Thus, the number of total particles participating in CMP decreases due to 
agglomeration. However, ceria particles have a negative charge in neutral 
and alkaline pH regions, where stable dispersion stability enables high 
removal rates of the PETEOS and CVD Si,N, films. 

Figure 3.11 shows the particle size distribution and the agglomerated 
particle behavior of ceria slurries for STI CMP as a function of suspension 
pH. Figure 3.11a shows that the acidic suspension had greater particle size 
and broader size distribution of abrasive ceria than those of the neutral 
or alkaline suspensions. The ceria slurry in acidic suspension was found 
to be unstable with a large agglomeration of particles, having a wide size 
distribution ranging from 0.02 to 100 pm resulting from lower electro- 
static repulsive forces. The surface potential of ceria particles decreases in 
acidic suspensions. Figure 3.11b shows the agglomeration of abrasive ceria 
particles as a function of the suspension pH by in situ optical microscopy. 
A great number of large agglomerates were observed in the acidic slurry. 
The suspension became unstable because of a decrease in electrokinetic 
potential in the acidic pH region. The ceria slurries in neutral and alkaline 
suspensions were found to be well dispersed and large agglomerates were 
not observed. 


3.3.2 STI CMP Performance in Different Suspension pH 


The results of the CMP field evaluation are shown in Figure 3.12, and the 
removal rates of oxide and nitride and the selectivity of slurries are pre- 
sented in Table 3.2. The slurries prepared in neutral and alkaline suspen- 
sions had high removal rates of PETEOS and low WIWNU. 

As shown in Figure 3.10, the polishing rate is dependent on the electro- 
kinetic behaviors among ceria particles and the PETEOS and CVD Si,N, 
films at a given pH. This result was only considered from the interaction 
between the ceria particles. Furthermore, the removal rate is also affected 
by the suppression of SizN, film removal by the additive polymer. In this 
study, poly(acrylic acid) (PAA) was used to suppress the removal of the 
CVD SiN, film. The interactions between the PAA and the ceria particle/ 
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FIGURE 3.11 (a) Particle size distribution of ceria slurries; (b) optical micrographs of ceria 
slurries in suspensions of different pH. 


TABLE 3.2 
Removal Rate and Selectivity in CMP Field Evaluation 
Selectivity 
Oxide R/R Nitride R/R (Oxide 
(Â/min) (Â/min) R/R:Nitride R/R) 

Acidic slurry 156.6 85.8 1.8:1 
Neutral slurry 3098.9 69.7 44.5:1 
Alkaline slurry 3718.7 731 5:1 


deposited films were controlled by suspension pH. That is, coiled confor- 
mation of PAA at pH 3 permits a denser spacing on the PETEOS and CVD 
SiN, films, which results in a decrease in the removal of PETEOS and sili- 
con nitride film. The PA A is adsorbed on both the PETEOS and CVD Si,N, 
films, which suppresses the deposited film's removal due to formation of 
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FIGURE 3.12 The result of CMP field evaluation: (a) average PETEOS removal rate and (b) 
average nitride removal rate. 


a polymer-coated layer. As shown in Figure 3.11, dispersion stability of 
acidic slurry was found to be unstable, which resulted in a low PETEOS 
removal rate as well as worse non-uniformity in the CMP field evaluation. 
During CMP at pH 6, the ceria particles with PMAA, as dispersant, have a 
negative charge, and the PETEOS and CVD SiN, films have negative and 
positive charges, respectively. The PAA has negatively charged functional 
groups. The CVD Si,N, film will have a higher affinity to this polymer 
than the PETEOS film would, that is, the PAA adsorbed only onto silicon 
nitride film with the formation of a polymer-coated layer. This was attrib- 
uted to the selective adsorption due to the charge difference between the 
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PETEOS and CVD Si,N, films in the neutral pH region. The suppression 
of silicon nitride film removal occurred due to the adsorption of PAA. 
Thus, high removal rates of PETEOS and oxide-to-nitride selectivity were 
obtained at pH 6. At pH 9, all materials have negative surface charges, 
resulting in a net negative charge, causing repulsion between the PAA 
and the surface of PETEOS and CVD Si¿N, films. Consequently, there is 
no polymer-coated layer formed on PETEOS and CVD SiN, films in an 
alkaline environment. The removal rate of the PETEOS and the CVD Si,N, 
films both showed higher values at pH 9. In addition, Si** ions on PETEOS 
film dissolved at pH 9, which caused the PETEOS film to become softer. 
Therefore, the highest removal rate of the PETEOS was obtained in the 
alkaline pH region. Based on the results, high removal rates of PETEOS 
film and selectivity can be obtained by using neutral slurry. 

The removal rate of the STI CMP process depends not only on the elec- 
trokinetic behaviors of abrasive ceria particles and the deposited films, but 
also on the conformation of the adsorbed polymer. The additive polymeric 
chains were found to have an opposite or identical charge with respect to 
the deposited film in certain pH regions. Thus, the selective adsorption of 
polymeric chains onto the surfaces of the CVD Si,N, film in the neutral 
pH region was due to the CVD Si,N, film having higher affinity than the 
PETEOS film for this polymer. Consequently, it is concluded that the con- 
trol of suspension pH, which enables the dispersion stability and selective 
adsorption of polymeric chains, is the technical key for good uniformity 
of the deposited film surface and for high selectivity of STI CMP. 


3.3.3 The Conformation of Polymeric Molecules 
and STI CMP Performance 


Aqueous titration was carried out to observe the ionization behavior and 
conformational change of PAA as a function of pH. Figure 3.13 shows 
the degree of ionization, a, at different ionic strengths, where a = COO-/ 
(COO- + COOR). As the ionic strength increased from 0 to 0.4 M, titra- 
tion curves for PAA were slightly shifted to the left and the degrees of 
ionization of PAA increased from 0.743 to 0.825 at pH 6.5. The addition of 
KNO; provides counterions to stabilize the negative charges of carbox- 
ylate groups along the backbone of PAA, which subsequently shifts the 
equilibrium to the left. The counterions also form an ionic atmosphere 
around the negatively charged carboxylate groups of PAA, which shields 
the Coulombic interactions between COO- and H* and screens the repul- 
sion between the carboxylate groups. The reduced intersegment repulsion 
between screened carboxylate groups leads to a conformational change 
from a stretched to a coiled configuration. 

Figure 3.14 shows the adsorption isotherms for PAA on silicon nitride as 
a function of ionic strength. By increasing the ionic strength from 0 to 0.4 
M, the amount of PAA adsorbed on the silicon nitride surface increased 
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FIGURE 3.13 Dissociation of PAA as a function of pH and ionic strength. 
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FIGURE 3.14 Adsorption isotherms for PAA on SiN surface at pH 6.5. 


from approximately 0.055 to 0.097 mg/m’. The amount of PAA adsorbed 
in the presence of KNO, was increased by the charge neutralization of 
PAA due to the screening of the electrostatic attraction between COO- 
and H*. The dissolved potassium ions can form an ionic atmosphere 
around the negatively charged carboxylate groups of PAA, and it screens 
the electrostatic repulsive forces between adjacent carboxylate groups. 
This screening results in the charge neutralization of negatively charged 
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carboxylate groups. Thus the conformation of PAA was known to change 
from a stretched to a coiled structure by charge neutralization. The con- 
formation of the polymeric molecules affected the adsorption behavior 
of polymeric molecules on the solid phase, which is primarily affected 
by the suspension pH, the molar mass of polymeric molecules, and the 
number density and valence of the counterions. A low ionic strength per- 
mits a stronger interaction with the aqueous phase and reduces the driv- 
ing force for adsorption onto the SizN, due to the electrostatic repulsion 
between neighboring carboxylate groups in the PAA chains. This results 
in a reduced uptake and a more extended adsorption conformation with 
fewer attachment points on the SizN, surface. In contrast, a high ionic 
strength results in a stronger, more compact interaction with the SiN, 
surface through nonelectrostatic interactions between the SizN, surface 
and PAA chains. 

Figure 3.15 represents the effect of ionic strength on the intersegment 
forces between the adsorbed PAA on SizN, film and SiN, AFM tip. The 
strong repulsive force of the PAA solution in the absence of KNO, orig- 
inates at a separation of 30 nm. This was attributed to the electrostatic 
repulsion between the negatively charged carboxylate groups of PAA at 
a pH 6.5. The separation distance was decreased by increasing concen- 
trations of KNO;. With the addition of 0.4 M KNO,, the repulsive force 
was observed at the reduced separation distance of 5 nm. This result con- 
firms that charge neutralization for the negatively charged carboxylate 
group of PAA is accelerated when the ionic strength is higher. In particu- 
lar, as can be seen in the logarithmic plot in the inset of Figure 3.15, the 
repulsive forces of the PAA solution with 0 M, 0.2 M, and 0.4 M KNO, 
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FIGURE 3.15 Force-separation profile between the AFM tip and a SiN film with different 
concentration of KNO; at pH 6.5. 
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FIGURE 3.16 Schematic diagram of the adsorption behavior and conformation of PAA on 
SiN film. 


follow the Poisson-Boltzmann equation under the separation distances 
of 4.2, 3.6, and 2.0 nm, respectively. Below these separation distances, the 
repulsive force between the PAA adlayers is dominated by steric hin- 
drance. Therefore, the adsorption thickness of PAA on the nitride film 
can be considered to be decreased from 2.1 to 1.0 nm by increasing the 
concentration of KNO, to 0.4 M. With the increase of ionic strength, the 
thickness of PAA adsorbed on the Si,N, decreased. The thicknesses of the 
PAA adlayer on the Si¿N, surface was 5.2, 4.5, and 3.8 nm for PAA with 0 
M, 0.2 M and 0.4 M KNO,, respectively. Atomic force microscopy (AFM) 
and Nanospec data show a similar trend; the thickness of PAA decreases 
as ionic strength increases. This implies that the coiled conformation of 
PAA forms a dense passivation layer on the Si,N, film. The adsorption 
behavior and conformational change of PAA are depicted schematically 
in Figure 3.16. The trends in removal rates along the whole of the wafer for 
the SiO, and Si,N, layers are shown in Figure 3.17. Since the PAA solution 
with the addition of 0.4 M KNO, is expected to effectively suppress the 
removal of the Si,N, films, a CMP field evaluation was performed using a 
PAA solution without KNO, and with 0.4 M KNO.. The removal rate of the 
SiO, film was 2690 A/min, 2657 A/min and 2610 A/min for PAA with 0 M, 
0.2 M, and 0.4 M KNO,, respectively. As the ionic strength increases, the 
particle is agglomerated by screening the charges on the particle surfaces, 
which results in increasing the removal rate. However, for the purposes 
of the present study, the KNO, was added into the PAA solution, then the 
ceria slurry and the PAA solution are mixed at the point of use (POU). 
Therefore, the removal rate of the SiO, film was affected by, not the ionic 
strength, but the size and crystallinity of the ceria particle. However, the 
PAA solution with 0.4 M KNO, resulted in lower removal rates of the CVD 
Si,N,. The average removal rate of the Si,N, film decreased from 72 A/ 
min to 61 A/min as ionic strength increased up to 0.4 M. The characteris- 
tics of the CMP evaluation are attributed to the conformational change of 
PAA. As ionic strength increases, negatively charged carboxylate groups 
of PAA are neutralized by the presence of potassium ions and thus PAA 
adopts a coiled configuration. This leads to a reduction in the potential 
for the penetration of abrasive particles into the film and decreases the 
friction force between the abrasive particles and the film due to the dense 
passivation layer. As a result, the Si,N, film was successfully passivated by 
using a PAA solution with 0.4 M KNO,, which resulted in a lower removal 
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rate than the PAA solution without KNO;. Consequently, the SiO,-to-SizN, 
selectivity increased from 37:1 to 42:1 as ionic strength increased up to 0.4 
M. It is noteworthy that the successful passivation of SizN, in STI CMP can 
be accomplished by controlling the conformation and adsorption amount 


of PAA. 
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FIGURE 3.17 Removal rate trends along the entire resultant (a) SiO, and (b) SiN film. 
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3.4 Force Measurement Using Atomic Force 
Microscopy for Mechanism 


The force-separation profile between the AFM tip and the PETEOS or 
CVD Si¿N, film was measured to analyze the interaction between PAA 
and the deposited film (Figure 3.18). Force measurement using AFM is 
expected to indicate the direct evaluation of the adsorption behavior of 
the polymeric molecules on the film surface. Figure 3.18a shows the force— 
separation plot of the Si tip with PETEOS film at pH 6.5 for different molec- 
ular weight of PAA. The surface of the oxide film, whose pH, is about 3.4, 
is negatively charged at pH 6.5, and thus a repulsive force occurred due 
to the electrostatic interaction. The interaction range is about 20 nm in 
the absence of PAA. It is of interest that there is no significant difference 
between the surface forces of the tip and the oxide film even with the 
presence of PAA. This result is almost the same for all samples, irrespec- 
tive of molecular weight, which means that PAA is scarcely adsorbed on 
the oxide film due to the electrostatic repulsion between like-charge of 
the film and PAA. This is because the adsorption behavior of partially or 
fully ionized polyelectrolyte such as PAA is dominantly affected by elec- 
trostatic interaction. 

On the other hand, Figure 3.18b illustrates a meaningful change in the 
interaction force between the nitride film and Si¿N, tip with and without 
PAA having the different molecular weight of 5,000, 15,000, and 30,000. In 
the absence of the absorbed PA A molecule, an attractive force was observed 
at approximately 10 nm of separation distance and the maximum attrac- 
tive separation force is about 0.5 nN at 5 nm of separation. However, it 
was found that the attractive force disappeared and a repulsive force was 
shown by the addition of PAA. This result is more clearly observed as the 
molecular weight of PAA increases. In PAA 5,000 solution, it was shown 
that the attractive force drastically decreased, even though a very weak 
attraction still exists. 

Strong repulsive forces were observed for PAA 15,000 and 30,000 solu- 
tions. In the case of PAA 15,000 and 30,000, the repulsion force starts to 
originate at a separation of 30 nm. Especially, a more significant increase 
in the interaction force is observed from about separation of 7 nm in the 
case of PAA 30,000 solution, compared to PAA 15,000 solution. The pH 
6.5, where the experiment was performed, is almost the same as the pH. 
of silicon nitride, and thus net charge forces on the surface of both the 
tip and wafer is nearly zero in the absence of PAA, which results in a 
van der Waals attractive force. However, the presence of PAA in the sys- 
tem leads to the adsorption of PAA on the nitride film and the resulting 
formation of PAA layer, and thus the repulsion between the dense PAA 
layers adsorbed onto the tip and the film is observed. As can be seen in 
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FIGURE 3.18 Force-separation profiles between the AFM tip and a film with different Mw 
of PAA at pH 6.5: (a) oxide, (b) nitride. 


the logarithmic plot in the inset of Figure 3.18b, the repulsive forces of 
PAA 15,000 and PAA 30,000 follows the Poisson—Boltzmann equation up 
for 4 and 7 nm of the separation distance, respectively, which indicates 
that the interaction between PAA layers are of electrostatic origin. Below 
these separation distances, the PAA layer is responsible for the repulsive 
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interaction, that is, the steric contribution. It is suggested that the adsorp- 
tion thickness of PAA on the nitride film is 2 nm for PAA 15,000 and 3.5 
nm for PAA 30,000. Thus, the increase in repulsion force with molecular 
weight of PAA is attributed to the increase of the PAA adsorption layer 
thickness and the formation of a denser adsorption layer. This AFM result 
agrees with Vedula and Spencer's work. They revealed that the maximum 
adsorption amount of PAA is increased as the molecular weight increases 
and the adsorption behavior of PAA on amphoteric surfaces in an aque- 
ous suspension is qualitatively similar for a variety of materials, including 
Si¿N,. 

The different adsorption behavior of PAA on the PETEOS/CVD Si¿N, 
can be further described by the AFM images (Figure 3.19). The morphol- 
ogy and surface roughness of the oxide and nitride film with and without 
PAA 30,000 were observed. In the case of oxide, there is no significant 
change of the film surface between the bare oxide film and the PAA 30,000 
spin-coated film, which means PAA is not adsorbed on the oxide film, as 
shown in Figure 3.19a and Figure 3.19b. On the other hand, it was clearly 
found that the surface roughness of the PAA 30,000 spin-coated nitride 
film (0.469 nm) is much higher, compared to that of the bare nitride film 
(0.156 nm). This is attributed to the formation of the PAA adsorption layer 
on the nitride film due to the electrostatic interaction between PAA and 
the film surface. These results are in agreement with the AFM force mea- 
surement shown in Figure 3.18b. 
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FIGURE 3.19 AFM images of nitride film. 
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Additionally, it is reported that PAA in the CeO, slurry has an influence 
on the dispersion stability of the CeO, particles. Because the suspension 
stability is significantly reflected in the interaction between the CeO, par- 
ticles in the CMP slurry and the deposited film, the relationship between 
the organic additive having the different molecular weight and the disper- 
sion stability of the CeO, particles should be considered. Figure 3.20 illus- 
trates the particle size distribution of CeO, as a function of the molecular 
weight of PAA. It was found that the size distribution of CeO, becomes 
broader and the mean secondary particle size is increased from 37 nm 
to 46 nm as the molecular weight of PAA is increased from 0 to 30,000. 
Also, the volume fraction of the agglomerated particles increases with an 
increase in molecular weight of PAA, indicating that particle agglomera- 
tion occurred in shallow secondary minima with a decreasing of electri- 
cal repulsive forces. Therefore, it is expected that the increase in molecular 
weight of PAA leads to the decrease in the suspension stability. 

The results of the CMP performance as a function of the molecular weight 
of PAA are shown in Figure 3.21. It was found that WIWNU is maintained 
at a low level, regardless of the molecular weight of PAA. This indicates that 
all of CeO, slurries used in this study have reasonably good dispersion sta- 
bility, even though there is a slight difference in the agglomeration particle 
size among them. The removal rates of both the oxide and nitride films are 
decreased as the molecular weight of PAA increases. The removal rate of the 
average oxide film proportionally decreases from 3365 A/min to 2420 A/ 
min as the molecular weight of PAA increases from 0 to 30,000. This result 
is attributed to the relationship between the stability of the CeO, slurry and 
the molecular weight of PAA as shown above. A stable dispersion enables 
a high removal rate of the deposited film due to an increase in the contact 
area between the particles and the film because the number of total particles 
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FIGURE 3.21 Removal rate trends along the entire resultant (a) oxide and (b) nitride film. 


participating in the CMP process increases. However, it is noteworthy that 
the removal rate of the nitride dramatically decreases from 453 A/min to 59 
A/min with increase in molecular weight of PAA from 0 to 30,000, indicat- 
ing that it does not show a linear relationship with the dispersion stability 
of CeO,. 
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From the AFM result shown in Figure 3.18, it can be considered that this 
drastic decrease in the removal rate of the nitride film as a function of the 
molecular weight of PAA is due to the formation of the passivation layer 
due to the strong interaction between PAA and nitride film. Force mea- 
surement using AFM clearly showed that the adsorption behavior of PAA, 
namely, the adsorption layer thickness on the nitride film, is strengthened 
with an increase in molecular weight. In addition, the PAA layer formed 
on the nitride film is scarcely removed, but the PAA layer can be sustained 
under the mechanical stress of the down pressure and spindle force dur- 
ing the STI CMP process. 

This phenomenon can be explained by two mechanisms: (1) the strong 
adhesion occurs due to the electrostatic interaction between PAA and the 
nitride film at pH 6.5 and (2) the adsorbed or nonadsorbed PAA mole- 
cule reduces the mechanical stress on the nitride film, acting as lubricant. 
Although the destruction (delamination) of the PAA layer on the nitride 
film occurs under an applied load, the existence of PAA in the bulk solu- 
tion can be dynamically readsorbed on the nitride surface, which is newly 
exposed after the removal of PAA layer. Thus, it leads to the reduction 
of the possibility of the penetration of the abrasive CeO, particles on the 
wafer and the decrease of the friction force between the abrasive particle 
and nitride. On the other hand, PAA is seldom adsorbed on the oxide 
film, regardless of molecular weight, due to the electrostatic repulsion 
between the like-charge of the PAA and the oxide film surface. Therefore, 
this difference in the removal mechanism of the oxide and nitride films 
gives rise to the dramatic increase in the oxide-to-nitride selectivity val- 
ues from 7:1 to 41:1 as the molecular weight of PAA increases from 0 to 
30,000. Thus, the formation of a passivation layer on the nitride film by the 
adsorption behavior of the polymer plays a dominant role in determining 
the oxide-to-nitride selectivity in STI CMP, and this selectivity is signifi- 
cantly affected by the molecular weight of the polymer. Consequently, it 
was obtained by the force measurement using AFM that the control of the 
molecular weight of PAA is a technical key to obtaining high oxide-to- 
nitride selectivity in STI CMP. 


3.5 Pattern Dependence of High-Selectivity Slurry 


To improve the performance of high-selectivity ceria slurry in STI CMP, it 
is essential to control the slurry properties, including the pH, the concen- 
tration, the molecular weights of the organic additives, and the abrasive 
particle size. The dependencies of the removed amount and the surface 
roughness of SiO, and SizN, films on the molecular weight and the con- 
centration of PAA in ceria slurries containing abrasives with different 
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primary sizes, through STI CMP tests using blank and patterned wafers, 
were investigated. 

Cerium carbonate was used as a precursor to synthesize two types of 
ceria powder. The primary grain size of the polycrystalline ceria abra- 
sives was controlled by employing a calcination process for 4 h with two 
calcination temperatures of 700°C and 800°C. The secondary particle 
size of the abrasives was controlled by crushing the powders by using a 
laboratory-scale air jet mill and a wet ball mill. The ceria powders were 
crushed by wet mechanical milling for several hours to reduce their sec- 
ondary particle sizes to the target size of 130 nm, after initial mechani- 
cal dry jet milling for several hours to reduce the size to 300 nm. The 
ceria abrasives were then dispersed in deionized water and stabilized by 
adding 100 ppm of a commercially available dispersant (PMAA), along 
with 1 wt% of ammonium salt (Mw = 10,000; Darvan C, R.T. Vanderbilt, 
USA) as another dispersant of the abrasive particles. We also added an 
anionic organic additive (PAA; Polysciences, USA) at a concentration of 
up to 0.80 wt%, with one of three molecular weights (Mw = 30,000, 50,000, 
and 90,000). Each suspension was twice subjected to ultrasonic treatment 
for 15 min to break down agglomerates and promote mixing. An ice bath 
was used to control the temperature of the suspension during the ultra- 
sonic treatment. The suspension was aged for 12 h at room temperature 
with a wrist-action shaker and subjected to ultrasonic treatment for an 
additional 15 min prior to use. The solid content was initially controlled 
to 5 wt% of ceria powder in the suspension. We then diluted each slurry 
with deionized water to produce a final ceria abrasive concentration of 1 
wt%. Each slurry's pH was adjusted to the range of 6.0 to 70 by adding 
an alkaline agent. Table 3.3 lists the slurry characteristics, including the 
slurry pH, the different PAA pH values with the three molecular weights, 
and the experimental conditions during synthesis. 

The crystal structure and grain size were analyzed with a diffractome- 
ter (RINT/DMAX-2500, Rigaku, Japan) using Cu—Ka radiation (A = 0.1542 


TABLE 3.3 
Summary of Slurry Characteristics 
Milling 
Time 
ete (Hours) Chemical Additive Characteristics 
Calcination 
Slurry Temperature Concentration Molecular 

Sample pH (°C) Dry Wet (wt%) Weight pH 
A 30,000 


B 800°C 4 35 50,000 

9.0 0 ~ 0.57 6.5 
C 90,000 
D 


700°C 2 32 30,000 
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nm) at a scan rate of 2° min! (28 min’). The intensity was logged over a 
20 range of 25°C ~ 60°C with a scan step of 0.02*C. The grain sizes of the 
calcined powders were estimated using an x-ray line broadening method 
by applying the Debye-Scherrer equation. The morphology of the abra- 
sives was observed with a high-resolution transmission electron micro- 
scope (HRTEM; JEM-2010, JEOL, Japan). The secondary particle size in 
each slurry was measured by acoustic attenuation spectroscopy (APS-100, 
Matec Applied Sciences, USA). Each suspension pH was measured with 
an advanced benchtop pH meter (Orion-525A, Thermo Orion, USA) by 
adding KOH and HCI to control the range of 7.0 to 8.0. The rheological 
behavior of the slurry suspensions was examined with a controlled-stress 
viscometer (MCR300, Paar Physica, Germany). This viscometer has con- 
centric-cylinder geometry, enabling us to investigate the stability behavior 
of the slurry with an external-temperature-control bath circulator operat- 
ing at 25+0.1°C. 

For blanket wafer tests, we used conventional 8-inch silicon wafers pre- 
pared by the single-side polishing method. SiO, films were deposited by 
the PETEOS method. Si,N, films were formed by low-pressure chemical 
vapor deposition (LPCVD). For the patterned case, the SKW-3 pattern 
wafer designed by SKW Associates was used for characterization with 
respect to the pattern density and pitch size. The STI mask consisted of 
4mm x 4mm density and pitch structures dividing the 20 mm x 20 mm 
die into five rows and five columns. Figure 3.22 illustrates the specially 
designed layout of the SKW-3 pattern wafer, including (a) the pattern den- 
sity and pitch size layout, (b) the mask floor plan, and (c) a cross-sectional 
view. The thicknesses of the as-deposited SiO, and Si,N, films on the blan- 
ket and SKW-3 patterned wafers were 700 nm and 150 nm, respectively. 
In the density structure (where density is defined as “trench width (TW)/ 
[trench width (TW) + active width (AW)]” or the trench area over the total 
area), the pattern density is varied systematically from 0% to 100% in 
increments of 10%, with a fixed pitch of 100 pm. The density structures are 
fabricated in a random layout in order to place high-density regions next 
to low-density regions. In the pitch structure, the density is fixed with 
the same trench width and space (50%), and the pitch is varied from 1 to 
1000 um, with vertically oriented lines. A cross-sectional pattern image of 
active SiN, and field SiO, layers before and after polishing were observed 
by scanning electron microscopy. 

For the CMP process, the films were polished on a Strasbaugh 6EC 
polisher, with an IC1000/Suba IV stacked pad (Rodel, USA). The polish- 
ing pressure, applied as a down force, was 4 psi, equivalent to 275 kPa. 
The relative velocity between the pad and the wafer was 0.539 m/s. The 
polishing time was 30 s. The SiO, and Si¿N, film thickness variations of 
the wafers before and after CMP were measured with a NanoSpec 180 
(Nanometrics, Milpitas, California) and an Opti-probe (Therma-Wave, 
Fremont, California). Cross-sectional images of the SKW-3 patterned 
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FIGURE 3.22 Specially designed layout of the SKW-3 pattern wafer: (a) pattern density and 
pitch size layout, (b) mask floor plan, and (c) cross-sectional view. 
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FIGURE 3.23 Schematic process flow of a typical STI CMP process. 


wafers were obtained with a high-resolution scanning electron micro- 
scope (HRSEM; EP-1040, Hitachi, Japan). To analyze the surface roughness 
of the SiO, and Si,N, films, an area of 1.0 um x 1.0 um was character- 
ized with a commercial multimode atomic force microscope (AFM; XE 
150, PSIA, Korea). A typical shallow trench structure was used to isolate 
the active regions where devices would be fabricated. The Si,N, layer was 
attended, and a shallow trench was etched into the silicon, as illustrated 
in Figure 3.23. A SiO, film was then deposited into the trench, resulting in 
an overburden of SiO, above the Si,N, active areas. In the ideal STI CMP 
process in mass production, the SiO, film is roughly removed in all local 
step-coverage regions, leaving SiO, film only in the trench regions. Fumed 
silica slurry was used for the first CMP step, while the ceria slurries were 
used for the second CMP step in the polishing process to achieve stopping 
on the Si,N, film surface after complete removal of the SiO, film. 

Figure 3.24 shows the HRTEM images and XRD powder diffraction pat- 
terns of the abrasive particles calcined at two temperatures (700°C and 
800°C). These images indicated that the primary grain size increased 
with calcination temperature, and that the morphology of the ceria par- 
ticle varied according to the calcination temperature. The abrasives 
calcined at 700°C showed a relatively low crystallinity, whereas those 
calcined at 800°C exhibited a relatively high crystallinity and the shapes 
of the grain are well defined, though some grains seem to contain sub- 
grain boundaries inside. In the ring-shaped diffraction, the particles cal- 
cined at temperatures as low as 700°C still maintain their crystallinity as 
shown in Figure 3.24a. This result coincides with the XRD peaks shown 
in Figure 3.24b. In addition, the slurry calcined at 700°C contained both 
medium-sized particles and many small primary particles, whereas the 
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FIGURE 3.24 HRTEM photograph and XRD powder diffraction patterns of the abrasive 


particles in a ceria slurry: (a) HRTEM photograph and diffraction pattern and (b) XRD pow- 
der diffraction pattern as a function of calcination. 


66 Nanoparticle Engineering for Chemical-Mechanical Planarization 


other slurry was composed of uniformly distributed, medium-sized par- 
ticles. As confirmed by the TEM images, the slurry calcined at 700°C had 
a wider size distribution than that of the other slurry. The XRD patterns of 
the powders calcined at different temperatures is shown in Figure 3.24b. 
Broader intensity peaks were observed for the ceria powders, which were 
synthesized at 700°C. This result may be considered by the low crystallin- 
ity with unreacted cerium carbonate and small-sized abrasives grain. 

The diffraction pattern only shows the peaks of cerium oxide with a flu- 
orite structure; those for other compounds, such as cerium carbonate and 
cerous oxide, were not detected. With increased calcination temperature, 
the characteristic peaks of CeO, became sharper because the grains of 
single crystals were proportionally grown by heat treatment. This result 
affected the average grain size of the particles. The primary grain size 
of CeO, was investigated to clarify the relationship between the calcina- 
tion temperature and the physical characteristics of the particles. The line 
broadening of the (111) peak in XRD was analyzed to confirm the primary 
grain size of particles. The intensity peak at 20 = 28.2°C was chosen for 
calculating the grain size, since it was clearer than any other peak and 
isolated from the others. The grain size moderately increased overall from 
27 to 36 nm as the calcination temperature was increased from 700°C to 
800°C, which can be attributed to thermally promoted grain growth dur- 
ing the calcination process. These results are in agreement with the trend 
of increasing grain size in the TEM images shown in Figure 3.24a. 

Figure 3.25a shows the distributions of the secondary particle sizes for 
both slurries without PAA addition. There was no difference in the distri- 
bution for small particle sizes of 0 to 0.6 um. On the other hand, the slurry 
calcined at 700°C had a distribution with a higher range of large particles 
(>3 um) than the other slurry. Figure 3.25b shows the median sizes (ds) of 
the abrasives in each slurry as a function of the PAA concentration. With 
increasing PAA concentration, the average secondary particle size gradu- 
ally increased within the concentration range from 0 to 0.60 wt%. 

The average secondary particle size of the polycrystalline abrasives in 
ceria slurry is thought to be determined predominantly by PAA adsorp- 
tion on the abrasives particle in the ceria slurry suspension. Generally, 
the amount of anionic PAA adsorbed on the abrasive particle surfaces, 
the configuration of the adsorbed PAA molecules, and the electric surface 
charge adsorbed from the particles by the PAA polymer chains control the 
agglomeration state and the stability of the dispersion. 

To evaluate the effects of the primary size of the ceria abrasives and 
the PAA concentration with different molecular weights on STI CMP, we 
conducted blanket wafer tests and measured the removal rates of SiO, and 
Si¿N, for the three slurry samples. Figure 3.26 show the results obtained 
from matrix experiment conducted by varying the molecular weights and 
concentrations of the PAA, along with the primary size of the ceria abra- 
sives in each slurry. The removal rate of SiO, was reduced with increasing 
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FIGURE 3.25 (a) Abrasive particle size distribution without surfactant addition and (b) 
average median (d;o) abrasive size as a function of the surfactant molecular weight at pH 
6.5 to 7.0. 
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molecular weight for the same primary size throughout the experimental 
range of PAA concentrations, as shown in Figure 3.26a. For the PAA with 
the highest molecular weight with a different primary size, however, the 
removal rate of SiO, film was markedly reduced, from 2184 to 537 A/min, 
as the PAA concentration increased. In contrast, in the case of the PAA 
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FIGURE 3.26 Results of the CMP tests of blanket wafers in terms of the surfactant molecu- 
lar weight: (a) removal rate of SiO, film and (b) removal rate of SizN, film. 
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with the lowest molecular weight, the removal rate only slightly reduced, 
from 2542 to 1901 A/min. Hence, with increasing PAA concentration, a 
higher primary abrasive size maintained a higher removal rate of SiO, at 
the same molecular weight. The removal rate of Si;N, film versus the PAA 
concentration for slurries with the three PAA molecular weights and the 
two primary abrasive sizes is shown in Figure 3.26b. Kang et al. (2004) 
previously reported that the contact probability of the abrasives on the 
film surface should strongly influence the removal rate. The passivation 
layer is formed by PAA adsorbed on the film surface during CMP, and 
that the effectiveness of this layer may depend on the amount of selec- 
tive adsorption on the film surface and on the concentration of PAA with 
increased molecular weight. Furthermore, we have attributed this to 
the behavior of abrasives moving in the PAA adsorption layer near the 
film surface. The removal rate of Si,N, film was markedly reduced with 
increasing molecular weight, and it essentially saturated beyond a PAA 
concentration of 0.30 wt%. In addition, as a result of increasing the PAA 
concentration from 0.1 to 0.3 wt%, the slurries whose PAA had a medium 
or the lowest molecular weight maintained higher removal rates of SizN, 
film than did the slurry whose PAA had the highest molecular weight. 
In other words, with increasing PAA concentration and the addition of 
PAA having the same molecular weight, the removal rates of SizN, film 
for all slurries were markedly reduced, and they very quickly saturated 
at a higher molecular weight. By comparing Figures 3.26a and b, we can 
calculate the removal selectivity of the SiO,-to-Si,N, films. For the highest 
PAA molecular weight (MW = 90,000), the selectivity increased approxi- 
mately from 10:1 to 13:1 with increasing PAA concentration. For the low- 
est molecular weight (MW = 30,000), however, the selectivity increased 
approximately from 4:1 to 51:1. 

To clarify these results, the slurry samples were used in the STI pla- 
narization step for actual patterned wafers. Figure 3.27 shows the 
removed amounts of SiO, and Si¿N, films versus the pattern density of 
the patterned wafer for different PAA molecular weights and primary 
abrasive sizes. The SiO, film was fully overpolished with increasing pat- 
tern density, as shown in Figure 3.27a. The removed amount of Si¿N, film 
increased with increasing pattern density throughout the experimental 
range of PAA molecular weights on the concentration of 0.42 wt%, as 
shown in Figure 3.27b. In addition, as contrasted with the blanket wafer 
tests, with a higher PAA molecular weight and addition of the same PAA 
concentration, the removed amount of Si¿N, film in active regions was 
gradually reduced for all slurries. With a low PAA molecular weight, 
however, a smaller primary abrasive size maintained a higher removed 
amount of Si;N, film at the same PAA molecular weight and concen- 
tration. Kang et al. (2004) reported that the abrasive size influences the 
effect of the PAA on the removal rate of a ceria slurry. They explained this 
result by using a model with the layer of PAA adsorbed or segregated on 
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the film surface: larger abrasives are more likely to penetrate the viscous 
layer of adsorbed PAA, contact the hydrated surface, and form covalent 
bonds like Ce-O-Si on the film surface. According to this mechanism, 
the particle size determining the possibility of penetrating the viscous 
layer, contacting the hydrated film, and removing the film surface is one 
of the most important factors affecting the removal rate. As the particle 
size decreases, therefore, the removal rates also decrease. On the other 
hand, with many small particles remaining in the slurry suspension, 
whose surface areas are so large as to easily cause greater adsorption of 
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FIGURE 3.27 Results of the CMP tests of patterned wafers in terms of the surfactant molec- 
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PAA molecules in the slurry, the Si,N, film can easily be removed because 
of the PAA adsorbed insufficiently on the densely separated Si¿N, film 
surface on a patterned wafer. 

Figure 3.28 shows cross-sectional SEM images of the 5-um pitch size 
with the density fixed at 50%, illustrating the edges of active SizN, and 
trench SiO, layers before and after polishing. With a higher PAA molecu- 
lar weight and the same PAA concentration, the removed amount of Si¿N, 
film for all three slurries was gradually reduced with the narrow pitch 
size of 5 um. The Si¿N, film erosion was clearly less for the PAA with 
the highest molecular weight, as compared to that for the low molecular 
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FIGURE 3.28 (See color insert) Pre- and post-CMP cross-sectional SEM micrographs: (a) 
pre-CMP, (b) slurry A_large (Mw 30k), (c) slurry B_large (Mw 50K), (c) slurry C_large (Mw 
90K), and (d) slurry D_small (Mw 30K). 
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weight. Hence, at the same molecular weight, a smaller primary abrasive 
size maintained higher erosion of Si¿N, film than with a large primary 
size. The amount of PAA adsorption on the smaller particles was much 
higher than that on the larger particles because of their higher specific sur- 
face area, resulting in extra consumption of the PAA in the slurry solution. 
By comparing the images before and after CMP, we could calculate the 
amount of SizN, erosion. In this study, we also confirmed that the order of 
the measured SizN, film erosion (90K [large] < 50K [large] < 30K [large] < 
30K [small]) did not change with respect to previous experimental results 
in this region with a low density of field SizN,. These results are in good 
agreement with the Si¿N, film erosion shown in Figure 3.27b. 

Figure 3.29a shows AFM line scan measurements indicating that a sig- 
nificant amount of SiO, local dishing occurred with overpolishing in a 
500-um-wide region. The dishing was reduced with increasing PAA 
molecular weight for the wide-field SiO, isolation region of 250 um. Yu 
et al. (1992) explained the mechanism of the dishing effect. For a narrow 
field width, the pressure exerted on the field SiO, is significantly reduced 
when the interface between the SiO, and Si,N, films is reached in the CMP 
process because the pressure applied by the pad is now concentrated on 
the SizN, layer as a result of its lower removal rate (about seven times lower 
than that of the field SiO,). In the wide-field region, the reduction in the 
local pressure is far less significant because of the elasticity of the pad, 
resulting in continued polishing of the field SiO, after the film interface 
(i.e., between field SiO, and active SizN,) is reached during CMP, so that 
the wider the field region, the smaller the reduction in the pressure act- 
ing on the field SiO,, and the greater the degree of dishing. The dishing 
of the field SiO, was significantly lower because of the higher molecular 
weight of PAA in the ceria slurry, as shown in Figure 3.29a, which means 
that the PAA was more tightly adsorbed on the SiO, film because of the 
chain length and the chain bridging effect for the higher molecular weight 
than for the lower molecular weight. It was found that the surface rough- 
ness of the active region was much higher for the PAA with the highest 
molecular weight than for that with the low molecular weight, as shown 
in Figure 3.29b. The surface roughness of the active SizN, region became 
worse with a higher molecular weight and the same primary abrasive size 
and PAA concentration, while a low value for the surface roughness of 
the field SiO, region was maintained. We previously reported, according 
to AFM analysis, that the adsorption of anionic PAA is attributed to the 
formation of a PAA adsorption layer on the SiN, film, as a result of the 
electrostatic interaction between the PAA and the film surface. 

The similar adsorption behavior of the PAA on the active Si¿N, films 
with a pattern density of 10% can be further characterized by the AFM 
images shown in Figure 3.30. The morphology and surface roughness 
dependencies after polishing of the active SizN, region on the different 
PAA molecular weights and primary abrasive sizes were observed. In the 
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FIGURE 3.29 (a) Post-CMP surface line scans of the wide-field SiO, region; (b) post-CMP 
RMS surface roughness values. 
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case of the different primary sizes, there was no significant change in the 
film surface between the Si,N, active surface and the adsorbed PAA after 
polishing. On the other hand, it was found that the surface roughness of 
the post-CMP Si¿N, film for the PAA with the highest molecular weight 
was much higher (0.355 nm) than that for the lowest molecular weight 
(0.280 nm), as illustrated in Figures 3.30a and 3.30d. This is attributed to 
the formation of the PAA adsorption layer on the SizN, film because of 
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FIGURE 3.30 (See color insert) Post-CMP three-dimensional AFM micrographs of pat- 
terned wafers: (a) slurry A_large (Mw 30K), (b) slurry B_large (Mw 50K), (c) slurry C_large 
(Mw 90K), and (d) slurry D_small (Mw 30K). 


the electrostatic interaction between the PAA and the film surface. These 
results are in good agreement with the AFM line scan measurements 
shown in Figure 3.29. 

The adsorption behavior of the PAA on the ceria particles was mainly 
caused by the different surface charges between the PAA and the ceria 
surface. The PAA is an anionic polyelectrolyte with an acidic carboxyl 
group, which leads to the ionization of the PAA molecules in the neutral 
pH region at which the ceria slurry for STI CMP is usually manufactured. 
Meanwhile, the net surface charge of the ceria particle is near zero in this 
pH region because the pH;., of ceria is approximately 6 to 7. Thus, the par- 
tially or fully ionized polyelectrolyte (PAA) is adsorbed on the ceria sur- 
face by electrostatic interactions. The electrostatic attractive force between 
adsorbed PAA molecules on the water-particle interface and the SiN, 
film surface can be classified as mainly resulting from the electrostatic 
interaction of the electric double layer surrounding the particles and the 
steric hindrance effect of the adsorbed PAA molecules on the Si;N, film. 
Since the interaction of the electric double layer may have increased in 
proportion to the surface potential of absorbed PAA molecules with the 
same counterion content, the change in the zeta potential is important 
for the dependence of the electric double layer on the suspension proper- 
ties. Moreover, the oxide film and the surface-modified ceria particles are 
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negatively charged above pH 3, while the Si,N, film is positively charged 
below pH 7. The attraction or repulsion between the abrasive particles 
and films (SiO, and Si;N,) results from the different electrostatic poten- 
tials exhibited in certain pH regions. Hence, during the blending of the 
slurry and additive solution, PAA that is used to form a passivation layer 
on the Si¿N, film can be additionally adsorbed on the surface of the ceria 
particles, which are basically covered by the same organic additive acting 
as the dispersant. This phenomenon could be explained as follows: the 
repulsive interaction between adjacent carboxyl sites is generated through 
the addition of more polymers, which then resulted in the conformational 
change of the adsorbed polymer. In addition, the carboxylic acid group 
appear to be necessary to suppress the SiN, removal rate during CMP 
process through hydrogen bonding between Si,N, film and carboxylic 
group in amino-acid-based ceria slurry. The electrostatic interactions 
between the abrasive particles in each slurry and the film surfaces, how- 
ever, may not fully explain the suppressed removal rate of Si;N, film and 
the removal selectivity of SiO,-to-Si,N, films with different PAA molecu- 
lar weights. Hence, it is necessary to consider other factors that influence 
the abrasive movement in a slurry, from the point of view of rheologi- 
cal behavior. These factors depend on the passivation layer of PAA at the 
interface between the film surface and the ceria slurry suspension. 

Figure 3.31 shows the rheological behaviors of various ceria slurries, 
with a fixed PAA pH of 7.0, as a function of the PAA concentration and 
molecular weight with different primary abrasive sizes. As shown in 
Figure 3.31a, for the PAA with the higher molecular weights, the slurry 
viscosity increased markedly with the PAA concentration, but it barely 
increased with the concentration in the case of the low molecular weight, 
regardless of the abrasive size. The primary abrasive size made no differ- 
ence in the average slurry viscosity for the two different primary sizes. The 
measured effects of the PAA concentration and molecular weight on the 
slurry viscosity are in good agreement with results previously reported. 
For the same weight concentration of ceria abrasives, the number of mol- 
ecules in a ceria suspension with a higher molecular weight PAA will be 
lower than that with a lower molecular weight. 

According to the Mark-Houwink-Sakurada equation, the relationship 
between the viscosity, molecular weight, and organic polymer type can 
be formulated as 


[ni] = Km Me 64 


where ni is the intrinsic viscosity, a and Km are constants for a specific 
polymer solvent system, and M, is the average molecular weight of the 
polymer. The constant Km depends on primary molecular features, such 
as the persistence length, while a depends on short-range interactions and 
their implied effect on the molecular weight. For each slurry with a range 
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FIGURE 3.31 Experimental slurry viscosity with surfactants of different molecular weights: 
(a) average slurry viscosity as a function of the surfactant concentration and molecular 
weight, and (b) intrinsic viscosity calculated from eqs. 1 and 2. 


of PAA molecular weights, the relation between the intrinsic viscosity and 
molecular weight is one of its most important properties. This relation can 
be represented by the following equation: 


In ni = In Km + a ln M, (3.5) 


Figure 3.31b shows the calculated parameters Km and a from a plot of In 
ni versus In Mw. For the constants in Equation 3.5, we chose the values for 
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PAA in a ceria slurry solution, enabling us to evaluate the average molecu- 
lar weight of PAA in this solution. Here, the amount of PAA adsorption 
(or segregation on the surface) depends on the bulk concentration of the 
PAA and the electrostatic interaction between the PAA and the film sur- 
face. Moreover, because a PAA with a higher molecular weight adsorbs 
more densely, the intrinsic viscosity (ni in Equation 3.4), which describes 
the particle behavior near the film surface, should increase and hinder 
the movement of particles. As a result, the removal rates of both the SiO, 
and the Si;N, films were reduced as the molecular weight and the con- 
centration of the PAA increased in the blanket wafer tests. Moreover, the 
removal rates of the SiO, and SizN, films can become important, depend- 
ing on the passivation layer of PAA existing at the interface. Thus, the 
addition of a PAA with a lower molecular weight appears to passivate 
the electrostatic interactions, thereby resulting in weaker adhesion of the 
adsorbed PAA layer through polymer chain bridging and branching, and 
possibly resulting in desorption of this layer above a certain applied load 
during the CMP process. As the PAA chain length increases, however, 
the lateral interaction among the hydrocarbon chains becomes more pro- 
nounced, resulting in the formation of a more effectively passivated layer 
of PAA. Consequently, with increasing PAA concentration and addition at 
a higher molecular weight, the SiN, removal rates for all slurries mark- 
edly reduced in the blanket wafer tests. Although in the case of a higher 
PAA molecular weight, the removal rate and erosion of Si;N, film could 
be reduced, and for a patterned wafer, the removal rate of the field SiO, 
film was also reduced, while the surface roughness of the Si,N, film in the 
active region was increased. 

Forthe PAA with the highest molecular weight in our experiments using 
blanket wafer, with different primary abrasive sizes, the removal rates of 
the SiO, and Si¿N, films were markedly reduced as the PAA concentration 
increased. Hence, with increasing PAA concentration, a higher primary 
abrasive size maintained a higher removal rate of SiO, at the same PAA 
molecular weight and concentration. For the case of patterned wafers, 
with a higher PAA molecular weight, the erosion of SizN, film could be 
reduced, but our pattern wafer tests showed that the removal amount was 
reduced and the surface roughness of the Si;N, film became worse. These 
results can be qualitatively explained from the layer of PAA adsorbed on 
the film surface in terms of electrostatic interaction and rheological behav- 
ior, including the molecular weights, concentrations of PAA, and different 
primary abrasive sizes in the ceria slurry. 
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FIGURE 1.5 CMP process of manufacturing. 
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FIGURE 2.1 Schematic of ILD CMP process. 


FIGURE 2.7 Analysis of remaining silica particles (particle size > 0.189um) on silicon wafers 
after post CMP cleaning; (left) modified slurry, (right) nonmodified slurry. 
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FIGURE 2.16 Wafer level variation for tool A. 
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FIGURE 2.17 Wafer level variation for tool B. 
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FIGURE 2.18 Die variation for tool A. 
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FIGURE 2.19 Die variation for tool B. 
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FIGURE 3.1 STI CMP process. 
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FIGURE 3.2 The effect of overpolishing on the characteristics of the device: (top) the decrease 
of gate length by overpolishing; (bottom) the shift of threshold voltage in the device. 
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FIGURE 3.6 TEM images and nano beam diffraction. 
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FIGURE 3.28 Pre- and post-CMP cross-sectional SEM micrographs: (a) pre-CMP, (b) slurry 
A_large (Mw 30K), (c) slurry B_large (Mw 50K), (c) slurry C_large (Mw 90K), and (d) slurry 
D_small (Mw 30K). 
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FIGURE 3.30 Post-CMP three-dimensional AFM micrographs of patterned wafers: (a) 


slurry A_large (Mw 30k), (b) slurry B_large (Mw 50K), (c) slurry C_large (Mw 90K), and (d) 
slurry D_small (Mw 30K). 
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FIGURE 4.10 Potentiodynamic polarization: (a) various alanine concentration, (b) 1.5 wt% 
alanine with and without PAM. 
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FIGURE 5.3 Topology map of wafer surface. 
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FIGURE 5.4 Basic concepts of soft and hard polishing pads. 
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FIGURE 5.6 A comparison of SSP and DSP mounting techniques and how these affect 
nanotopography and flatness. 
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FIGURE 5.21 Schematic structures of a 63-nm NAND-flash memory-cell with different 
floating-gate heights, induced by the wafer nanotopography of 10-mm-diameter scanning: 
(a) 45-nm height (free of wafer nanotopography influence); (b) 30.1-nm height (at the top 
of wafer nanotopography influence); (c) 59.9-nm height (at the bottom of nanotopography 


influence). 
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FIGURE 5.22 Programming operation of a NAND-flash memory-cell, where the program- 
ming cell is No. 5: (a) voltage bias conditions for memory cells and (b) parasitic capacitances 
during No. 5 cell programming. 
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FIGURE 6.9 Schematic process flow of the poly isolation CMP process. 


$ s 4 4 


© © 0.0 0°? Pee e, 
Amorphous Si Deposition 


Si Substrate 


FIGURE 6.10 Mechanism for the poly isolation CMP process. 
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Copper CMP 


==] 
4.1 Introduction 


As design technology and manufacturing process technology of ultra 
large scale integration (ULSI) technology are improved, integration 
doubles every three years. As the design rule of semiconductor devices 
decreases below 100 nm, device is becoming high integration and multi- 
layer metallization of circuits. Especially as the feature size of the transis- 
tor is decreased below 130 nm, the device operation should be improved. 
However, parasitic capacitance that is caused by the reduction of the gap 
between metal lines increases, which leads to the decline of device opera- 
tion. Therefore, copper (Cu) wire is used instead of aluminum (Al) wire to 
solve this problem. 

In the case of microprocessor, if tungsten (W) plug and aluminum metal 
line is used, metal wiring of 10x12 layers is need. However, the application 
of copper wiring enables 6x8 metal layers. This can prevent the opera- 
tion speed of device by the use of low-k dielectric to insulator of metal 
wiring instead of silicon oxide (SiO,), because copper has low resistivity 
and superior electromigration. These can be explained by the delay time 
equation: 


delay time « RC œ potkeo (4.1) 


where, R is resistance of metal wiring, and C is parasitic capacitance 
between metal wiring, po is resistivity of metal wiring, £ is the length of 
metal wiring, kis the permittivity of insulator between metal wiring, and 
eo is vacuum permittivity. 

If the design rules of a semiconductor device gets into 0.1 um low, element 
delay time is expected to be decreased with decreasing of gate. However, 
parasitic capacitance by the reduction of space length between Al metal 
wiring increases rapidly and delay time of the device increases rapidly. To 
solve this problem, the metal line/insulator substitutes Cu/low-k for Al/ 
SiO,, which results in a minimization of RC delay-time (Figure 4.1). 
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FIGURE 4.1 (a) Required delay time and (b) permittivity by design rule. 


Barrier metal such as Ta or TaN prevents the diffusion of Cu because Cu's 
solubility and diffusivity are high when wire that uses Cu is required. And 
also because Cu and low-k material are very soft, there is difficult point 
that CMP process must perform under low shear stress and pressure. 
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Because Cu-to-TaN removal selectivity is not high, dishing or erosion 
happens. As these phenomenon displays become serious, the tendency 
aspect ratio does increase. Therefore, Cu-to-[aN removal selectivity 
is required more than a given level in CMP. Because more than 20:1 of 
removal selectivity is required, dishing does not happen. Also, to prevent 
erosion, 1:1:1 of Cu:Ta:low-k removal selectivity is required as shown in 
Figure 4.2 and Figure 4.3. Dishing or erosion, which is caused after Cu 
CMP, can have an adverse effect on device operation, such as an increase 
of resistance. 
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FIGURE 4.2 (See color insert) Two-step process of Cu CMP. 
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FIGURE 4.3 Cu CMP roadmap. 
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4.2 High Selectivity for Copper CMP 


Cu CMP slurries commonly use submicron-sized colloidal silica abrasive 
particles dispersed in aqueous solutions that contain an oxidizer, as well 
as an complexing agent and corrosion inhibiting agents and other chemi- 
cals. Most of the slurries described in the article by several researchers 
use H,O, as the oxidizer and benzotriazole as the inhibitor, with various 
complexing agents such as organic polymer, alkaline agent, and organic 
amine in slurry. 

The pH value of the polishing slurry is one of the most important 
parameters influencing the polishing rate, surface roughness, and other 
performance characteristics of the Cu CMP process. In this section, the 
slurry’s pH and conductivity were adjusted to the range of pH 10 to 11 and 
conductivity of 8 to 10 (mS/cm) by adding an alkaline agent, including 
NH,OH and HCI solution. 

Figures 4.4 to 4.7 show the results obtained from an experiment con- 
ducted by varying the concentrations of complexing agent (alanine) and 
selectivity control agent (PAM) in aqueous slurry. Figure 4.4 shows the 
removal rate of Cu and TaN films versus the alanine concentration. The 
removal rate of Cu film increased with alanine concentrations. In addition, 
the removal rate of TaN film strongly suppressed and slightly increased 
with increasing alanine concentrations in aqueous suspension. As with 
the removal rate of Cu film, the removal rate of TaN film drastically 
decreased and was essentially saturated with a concentration of alanine 
beyond 0.5 wt%. 
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FIGURE 4.4 The removal rates of Cu and TaN films versus alanine concentration in slurry. 
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Alanine could exist in aqueous solution in three different forms, namely, 
CH3 CH(NH3+)COOH (cation), CH3 CH(NH3+)COO- (zwitterions), and 
CH3 CH(NH2)COO- (anion). These species are denoted as H2L+, HL, 
and L-, respectively, for brevity. The equilibrium between these may be 
depicted, as Babu et al. (2005) previously reported, the dissolution and 
removal rate probability of the complexing agent, including phthalic acid, 
citric acid, glycine, oxalic acid, and carboxyl and/or amine functional 
group, which interact on the Cu film surface should strongly influence the 
removal rate. 


pkal = 2.35 
CH3 CH(NH3*) COOH e CH3 CH(NH3*) COO: 
(H2L*) (HL) 
cation zwitten on 
(4.2) 
pka2 = 9.87 
«e CH3 CH(NH 2) COO: 
(Lo) 
anion 


Babu et al. explained that a complexing agent, such as amino group in 
glycine and hydrogen peroxide system, is protonated at pH <4.0, and thus 
may not effectively form chelates with positively charged metal ions; thus, 
the dissolution must be due to the carboxyl group. On the other hand, at 
pH >4.0 the amino group can chelate Cu2+ ion and cause the dissolution 
of the metal up to pH 10. However, alanine and H,O,-containing colloidal 
silica slurry exhibited an enhanced removal rate of Cu film at alkaline pH 
region. We thought that the alanine could be a very effective complexing 
agent with an increased removal rate of Cu film through a high dissolu- 
tion rate of Cu2+ ion in alkaline pH region. 

The suppression of the removal rate of TaN film could not be fully 
explained through the electrochemical phenomena by chemical reaction 
between complexing agent and the TaN film surface. We thought that the 
TaN film loss and the Cu-to-TaN removal selectivity are directly related 
to the electrostatic interaction and electrokinetic behavior due to chemical 
adsorption and steric hindrance of adsorbed organic chemical. 

Figure 4.5 shows the electrokinetic behaviors of Cu film, TaN film, and col- 
loidal silica slurries with alanine addition as a function of pH. The absolute 
surface zeta potential of the Cu film was slightly negatively charged above 
pH 5. The TaN film also exhibited a slightly negative charge at a pH above 
pH 5.3. Colloidal silica slurry with alanine exhibited a pH. at pH 4.0. 

The surface potentials of the colloidal silica abrasive particles in the 
aqueous suspension with alanine were strongly negatively charged above 
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pH 4.0, while the TaN film's surface potential was weakly negatively 
charged. The attraction behavior between the abrasive particles and the 
TaN film results from the different electrostatic potentials exhibited in 
certain pH regions. Therefore, we suggest that the selective adsorption of 
alanine added slurry on the abrasive particles and the TaN film surfaces 
correspond to the differing zeta potential charge. The removal rate of TaN 
film is drastically supressed, and increased slightly by this difference. 

By comparing the removal rate of Cu and TaN films, we can calculate 
the removal selectivity of Cu-to-IaN films (Figure 4.6). By increasing 
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FIGURE 4.5 Zeta potential of Cu, TaN films, and colloidal silica slurry with alanine as a 
function of pH. 
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FIGURE 4.6 Removal selectivity of Cu-to-TaN films versus alanine concentration in slurry. 
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FIGURE 4.7 The removal rates of Cu and TaN films versus PAM concentration in slurry. 


the alanine concentration, the removal selectivity drastically increased 
and essentially saturated from 5:1 to 32:1 with increasing alanine 
concentration. 

Figure 4.7 shows the removal rate of Cu and TaN films versus the PAM 
concentration. The removal rate of Cu film slightly decreased with PAM 
concentrations. Here, the removal rate of TaN film was strongly sup- 
pressed and saturated with increasing PAM concentrations in aqueous 
suspension as shown in Figure 4.7. 

To enhance the removal selectivity of Cu-to-TaN films with suppressing 
the removal rate of TaN film by selective adsorption, we also optionally 
added organic polymer (PAM) with the concentration of up to 0.7 wt%. 
The adsorption of PAM-added slurry on the abrasive particles and the 
film surfaces corresponds to the differing zeta potential charge. By this 
zeta potential difference, the removal rate of the Cu and TaN films was 
more suppressed, and the oxide-to-nitride removal selectivity increased 
with addition of PAM. 

Figure 4.8 shows the electrokinetic behaviors of Cu film, TaN film, and 
colloidal silica slurries with PAM addition as a function of pH. 

Adsorption of PAM on Cu and TaN film surfaces increases and reaches 
a strong suppressed point of approximately 0.3 wt%. In other words, PAM 
is more adsorbed on the Cu and TaN film surfaces. This is driven by the 
difference in zeta potential, which affects the interaction between PAM 
and each surface. 

In addition, above the isotropic point, the slightly negative-charged Cu 
oxide and TaN films surface can interact with the deprotonated between 
carboxyl groups of alanine, neutral -NH, groups, and NH+ functional 
groups of PAM, which results in the formation of strong complexes with 
Cu and TaN films. However, with addition of PAM, the removal rate of 
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FIGURE 4.8 Zeta potential of Cu, TaN films, and colloidal silica slurry with PAM as a func- 
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FIGURE 4.9 Removal selectivity of Cu-to-TaN films versus PAM concentration in slurry. 


Cu and TaN film decreased. By increasing the PAM concentration, the 
removal selectivity drastically increased and essentially saturated from 
30:1 to 130:1 (Figure 4.9). 

Potentiodynamic polarization studies were carried out to measure the 
corrosion current density and potential at various alanine and polyacryl- 
amide concentrations with H,O,. Polarization plots for Cu film asa function 
of alanine concentration with H,O, at pH 10 are presented in Figure 4.10a 
and b, respectively. The value of corrosion potential reduced gradually 
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FIGURE 4.10 (See color insert) Potentiodynamic polarization: (a) various alanine concen- 
tration, (b) 1.5 wt% alanine with and without PAM. 


with an increasing concentration of alanine (Figure 4.10a). As the alanine 
solution increases from 0.5 to 2.0, the fraction of carboxyl and amino func- 
tional group is more pronounced with increased anion fraction at alkaline 
pH. Since the bidentate (L—) is more reactive than the monodentate (HL), 
the increased dissolution rate of Cu* or Cu” ions. The removal rate of Cu 
film increases with an increasing alanine concentration by low corrosion 
potential. On the other hand, corrosion potential showed no difference 
with addition of PAM solution (Figure 4.10b). 
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4.3 Copper CMP Pattern Dependence 


The use of copper as an interconnect material in multilevel scheme DRAM 
and NAND flash memory in design rules beyond 45 nm is being increas- 
ingly considered mainly due to copper's low resistivity and high resis- 
tance to electromigration compared to the widely used aluminum (Al) 
alloys. As the device structure becomes more complicated and its dimen- 
sions shrink, the conductors on the chip must be thin enough to occupy 
less space. Such miniaturization of the conductor causes an increase in 
the RC delay, which is the product of the metal resistance (R) and the 
capacitance (C) of the interlevel dielectric. While the low resistivity of 
copper is expected to exhibit lower interconnect delay, high resistance to 
electromigration enhances the device reliability by increasing the mean 
time to device failure. In addition, in the manufacture of memory circuit 
devices with copper metallization, multilevel interconnects are formed 
using, the damascene method, whereby copper is deposited by chemical 
vapor deposition (CVD) or electroplating into vias and trenches etched in 
the interlayer dielectric (ILD) over a diffusion barrier usually made from 
titanium (Ti), tantalum (Ia), or their nitrides. For the application of cop- 
per as interconnect material, the film surface within wafer must be made 
planar on a global scale. Inlaid metal patterns in multilevel chips could be 
obtained by CMP. The CMP is used to planarize the barrier metal, low-k, 
and copper layer following their deposition process. 


4.3.1 Dishing Dependency on Feature Size and Pattern Density 


Dishing of copper lines is among the most important issues of copper 
CMP. Dishing reduces the final thickness of copper lines and degrades the 
planarity of the wafer's surface, resulting in complications when adding 
multiple levels of metal. Understanding of dishing and its nature is help- 
ful in process optimization and in understanding the process mechanism. 
Here, we present a thorough investigation of dishing in copper CMP. 
Along with studying the dependency of dishing on linewidth and pattern 
density, our investigation is focused on the effect of (over)polish time, oxi- 
dizer concentration in the slurry, and thickness of the as-deposited copper 
layer. As a result, a hypothesis of material removal mechanism for our 
type of slurry is presented. 

The test structures were fabricated as following. First, the interconnec- 
tion grooves were etched in thermally grown silicon dioxide by RIE with 
the depth of 600 nm. The width of the trenches varies from 2 to 100 mm. 
Second, after depositing a 50-nm Ti layer by sputtering an adhesion pro- 
moter layer, an 800-nm Cu film was deposited also by sputtering without 
breaking the vacuum. An IC1000/Suba IV stacked-perforated pad from 
Rodel was applied. Prior to every run, pad conditioning was done using 
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diamond tool. An alumina-based slurry (A1,O; particles, median diameter 
200 nm), which contains a complexing (carboxylic acid) agent and a stabi- 
lizer, was used throughout the investigation. Hydroperoxide (functions as 
oxidizer) was added to the slurry at three volume percentages: 7.5%, 10%, 
and 15%. After mixing, the pH of the slurry was measured to be around 
4.0. Polishing runs were carried out using a Presi Mecapol E460 polishing 
tool. Optimum settings for uniformity were applied, which were found 
for our test pattern. The polishing pressure was 250 g/cm?. The rotation 
speed of platen/pad and that of the wafer holder were set at 50 rpm. The 
supply speed of the slurry was 125 ml/min. Due to practical limitations, 
we used a timed polish process. The nominal polish time was determined 
by the moment when the entire wafer surface is clear from excess metal. 
From then on, if the wafer polishing is continued, we consider it as over- 
polishing and the time is called overpolish time. In our study, four polish 
times were used, which are nominal, 5% (of nominal time) overpolish, 
and 10% and 20% overpolish. 

Figure 4.11 shows the dishing amount of copper lines at different lin- 
ewidth and pattern density. The wafer was polished using slurry with 
15% hydroperoxide and the polish time was nominal. As expected, the 
dishing amount strongly increases with the increment of linewidth (see 
Figure 4.11). The dependency is not a linear function of linewidth. At a 
linewidth above 50 mm, the dishing levels off. The pattern density only 
shows a minor effect on copper dishing. Unlike dishing data published 
for other material, such as tungsten, dishing of copper lines appears to 
be relatively large even at nominal polishing. As shown in Figure 4.11, 
the dishing amount of 100-mm wide copper lines is more than 100 nm. 
However, it is well known that the removal rate is higher at dense areas, 
thus when the entire wafer surface is clear, which we defined as nominal 
polishing, dense areas must have been overpolished. In addition, there is 
always a certain non-uniformity of removal rate over the wafer. Therefore, 
when the entire wafer surface is cleared there definitely are areas that have 
been overpolished. We assume that overpolishing with very high removal 
rate of copper (typical 600 nm/min), high selectivity between copper and 
ILD (typical larger than 90), and a too thin as-deposited copper layer are 
the reasons of the large amount of dishing. 

Overpolishing is needed to ensure good electrical properties of inter- 
connection (no shorts between separate interconnect). However, overpol- 
ishing always results in an increasing amount of dishing and worsens 
the planarity of the wafer surface. Figure 4.12 shows the profiles of a test 
structure of 20-mm wide copper lines with a pattern density of 50% at 
nominal polish time and three overpolish times. The dishing increases 
dramatically with increasing overpolish time. As many authors have 
described, dishing as a result of the pad reaching into recess areas and 
removal of copper in the recess, there is a question raised if the dishing 
rate is the same at different linewidths. Therefore, we plotted the dishing 
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FIGURE 4.11 Dishing versus (a) linewidth and (b) pattern density. 


rates of different linewidths versus overpolishing time in Figure 4.13. The 
dishing rate of wide lines is higher than that of narrow ones. To explain 
this, we use the model proposed by Warnock saying that the pad reaches 
into recessed areas by bending and its roughness. The amount that the 
pad can reach into the recessed areas depends on the pad's properties 
(e.g., hardness, surface roughness), linewidth, and applied pressure. Since 
all the other conditions remain the same in our case, linewidth is the only 
factor that can affect the amount of pad reaching. Therefore, it directly 
relates to the amount of dishing as well as to the dishing rate. The model 
is thus in accordance with the obtained dishing rate behavior seen in 
Figure 4.13, that is, the different slopes are explained. 
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FIGURE 4.12 Surface profiles of a polished structure (20 um linewidth, PD = 50%) at differ- 
ent polish times. 
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FIGURE 4.13 Dishing of different linewidths at different polish times. 


Slurry chemistry has been reported to have a strong effect on polishing 
results. We also found a strong dependency of dishing on slurry chemis- 
try. Figure 4.14 illustrates the dishing of 100-, 50-, and 20-mm wide lines 
(pattern density 20%) polished by slurries with different concentrations of 
hydroperoxide at nominal polish time. It can be seen that the dishing is 
reversely proportional to the concentration of oxidizer in the slurry. The 
explanation we propose for this phenomenon is that with higher oxidizer 
concentration in the slurry, a more effective passivation layer is formed on 
the copper surface (it will grow faster). This passivation layer slows the 
removal rate of copper in the recess areas and better protects the copper 
lines from dishing during the overpolish step. We have found that the 
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FIGURE 4.14 Surface profiles of a structure (PD = 20%) polished by slurry with different 
oxidizer concentrations. 
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FIGURE 4.15 Dishing of as-deposited thin and thick copper layers at different linewidths 
and pattern densities. 


thickness of the as-deposited copper layer also affects the amount of dish- 
ing of copper lines. Two copper thicknesses are used to study this depen- 
dency. The thin copper layers with a thickness of 800 nm was described 
earlier. The thick copper layer is 1.5 mm thick. Dishing data for both thick- 
nesses at nominal polish times are shown in Figure 4.15. It is clear that 
the amount of dishing is smaller in all cases for the as-deposited thicker 
copper layers. Furthermore, the effect of thin versus thick copper layer on 
dishing appears to be even clearer at the large features, while only minor 
effects are observed for the small features. It is known that the removal 
rate of copper at dense areas is higher than that of field areas due to higher 
local pressure exerted on the features. Therefore, if the copper layer is too 
thin, the dense areas will be cleared first and experiences overpolishing 
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before the field areas are cleared and global planarity is reached. This 
leads to severe dishing at the dense areas. The copper layer should be 
sufficiently thick to reach a globally planar surface with still excess of cop- 
per on all features (wide, small, densely and widely packed). This will 
minimize the dishing. In this way, we managed to reduce the maximum 
dishing for all investigated feature sizes to less than 100 nm for nominal 
polishing conditions. 

Figure 4.16 shows etch rates of copper in the slurry without abrasive par- 
ticles at three investigated concentrations of oxidizer. We notice that the 
static etch rate of copper is very low (10 nm/min). On the other hand, the 
removal rate of copper during polishing is very high (more than 600 nm/ 
min is achieved). This is strong evidence of the forming of a passivation 
layer on the copper surface in the slurry. Further investigation of the chem- 
istry of the slurry gave us the following hypothesis about the forming of 
the passivation layer on the copper surface and thereafter a proposal for the 
copper removal mechanism in copper CMP with our slurry. The oxidizer 
(H,O,) reacts with Cu in acidic slurry (pH 4) and Cu? ions are formed. The 
anions of the carboxylic acid react with Cu ions (R(COO),Cu). Carboxylates 
of metals other than the alkali metals generally are insoluble. Therefore, 
we suppose that R(COO),Cu protects the copper underneath from etching. 
According to the Pourbaix diagram for the copper-H,O system, no copper 
oxide can be formed in our slurry (the pH of our slurry is about 4.0; at this 
pH, only two forms of Cu” or Cu are possible; see Figure 4.17). The concen- 
tration of H,O, strongly influences the amount of Cu? ions and, therefore, 
the amount of R(COO),Cu product. In other words, the effectiveness of the 
passivation layer is directly proportional to the H,O, concentration in the 
slurry. This is consistent with our experimental results. 
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FIGURE 4.16 Etch rate of copper in the slurry with different oxidizer concentrations. 
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FIGURE 4.17 Pourbaix diagram of the copper—H,O system. 


The hypothesis for the copper removal mechanism using this slurry is pro- 
posed to be as follows. Copper on the surface is oxidized by H,O, (in acidic 
environment) into copper cations. These cations then react with carboxylate 
anions to form the passivation layer that protects copper from etching. This 
layer is then removed at protruding levels by mechanical abrasion. Once 
removed from the surface, the “metallic soap” particles are swept away by 
the turbulent motion of the slurry. Further investigations of the passivation 
mechanism by varying the slurry chemistry are being conducted. 

The dishing behavior of copper lines has been extensively studied. 
Relationships between dishing and feature size, pattern density, overpol- 
ishing time, thickness of as-deposited copper layer, and slurry chemis- 
try have been elucidated. While dishing strongly depends on linewidth, 
only a small effect of pattern density has been observed. As expected, 
dishing dramatically increases with the increment of overpolishing time. 
Furthermore, the dishing rate dependency of overpolish time is not the 
same for all feature sizes. The larger the linewidth, the higher the dishing 
rate is. Thick as-deposited copper layers yield less dishing than thinner 
ones. The oxidizer concentration in the slurry also has a strong impact on 
the amount of dishing. It was found that, within the investigation win- 
dow, the dishing is inversely proportional to the oxidizer concentration. 
From the obtained results, a hypothesis for the passivation layer forma- 
tion has been proposed and the mechanism of copper removal has been 
presented. 


4.3.2 Pattern Effects on Planarization Efficiency 
of Cu Electropolishing 


Cu electropolishing technology has been explored as a replacement of 
the Cu CMP planarization process. Contolini et al. (1994) integrated Cu 
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electropolishing with a wet etching technology in a novel electrochemical 
planarization (ECP) method for Cu multilevel interconnects, and Wang 
(2000) has designed a commercial tool for the electropolishing process. 
In addition, Cheung (1997) proposed a process of Cu electropolishing to 
enhance CMP throughput. Recently, groups at TSMC and SONY companies 
also applied Cu electropolishing in global planarization technologies. 

Traditional electropolishing is an important surface treatment tech- 
nology, and can dissolve a metallic film uniformly and produce a 
smooth and bright surface. Furthermore, electropolishing has potential 
advantages in that it renders a reduced waste stream, is less consumable, 
and there is no applied pressure to the substrate, which is beneficial for 
future low-dielectric-constant-material integrated processes. In previ- 
ous studies of Cu CMP, Steigerwald et al. (1997) found that Cu dishing 
is a strong function of linewidth but is only weakly dependent upon 
pattern density. At the same pattern density, the amount of dishing 
increased as the linewidth increased. In this work, pattern effects of Cu 
electropolishing were discussed. Anodic potentiodynamic polarization 
measurement was also employed to clarify the dissolution mechanism 
of Cu electropolishing. 

The patterned wafer used was composed of a 30-nm-thick ionized 
metal plasma (IMP)-IaN layer as the diffusion barrier, and a 200-nm-thick 
IMP-Cu film as the seed layer. The experiments on Cu electroplating and 
electropolishing were carried out in a tank of nonconducting material at 
room temperature. The counterelectrode was a platinum plate and the 
working electrode was a sliced wafer with a size of 2 x 3 cm?. In Cu elec- 
troplating, the electrolytes included CuSO, - 5H,O (30 g/L), H,SO, (275 
g/ L), chloride ions (50-100 ppm), polyethylene glycol (40-2000 ppm), and 
2-aminobenzothiazole (10-100 ppm). The films were deposited under gal- 
vanostatic control. In Cu electropolishing, the electrolyte was phosphoric 
acid (H3;PO,) and the films were polished under potentiostatic control. 
Potentiodynamic (PD) polarization measurement was performed on an 
EG&G potentiostat/galvanostat (model 273A) with a Pentium PC. In these 
analytical experiments, the counterelectrode was platinum and the work- 
ing electrode was Cu with a constant surface area of 0.5 cm?. All potentials 
are reported relative to the Ag/AgCl electrode, which was used as the 
reference electrode. Cross-sectional profiles of Cu films were examined 
using a field emission scanning electron microscope (FESEM). Surface 
roughness was measured using an atomic force microscope (AFM). The 
sheet resistance of Cu deposits was measured by the four-point probe 
technique, and the resistivity measurements were carried out immedi- 
ately after deposition. 

Cu planarization process using ECP of Cu by electropolishing followed 
by CMP is depicted in Figure 4.18. After Cu electroplating completely fills 
the trenches and vias, electropolishing planarizes the surface down to the 
barrier layer, and the remaining Cu and the barrier metal are removed 
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Cu Electroplating FIGURE 4.18 Basic process of Cu electrochemi- 
cal planarization technology. 


by a single-step CMP process. Figure 4.19 shows the scanning electron 
microscope (SEM) cross-sectional profile of a 10-um Cu line planarized 
by the ECP process. In this case, the defect-free filling was obtained by 
an acid cupric sulfate electrolyte containing chloride (Cl), polyethylene 
glycol (PEG), and 2-aminobenzothiazole (2ABT). Subsequently, a clean 
and scratch-free surface was produced by electropolishing using HPO, 
as the electrolyte. Finally, CMP removed the remaining Cu and the bar- 
rier metal. The final-step CMP process used the H,O,-based slurry, which 
has a higher removal rate of TaN than that of Cu. For Cu electroplating, 
the combined action of CI-PEG-2ABT provided an inhibition gradi- 
ent between the opening and the bottom of a feature to obtain an obvi- 
ously selective deposition and to result in bottom-up filling. Tafel plots in 
Figure 4.20 reveal that the added 2ABT could enhance the charge transfer 
resistance to inhibit Cu deposition. The shifted overpotential, caused by 
the added PEG, was 61.7 mV relative to that of standard solution and the 
corresponding value for the combined action of PEG-2ABT was 77.2 mV. 


Copper CMP 
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FIGURE 4.19 SEM cross-sectional profile of a 10-um Cu line planarized by the ECP 
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FIGURE 4.20 Tafel plots of Cu electroplating with various electrolytes. 


The potentiodynamic curve of Cu electropolishing in the H,PO, (85%) 
solution is shown in Figure 4.21. For anodic potentials in the AB range, 
the metal surface became active. When the anodic potential is higher than 
the B point to the BC range, a viscous sublayer may start to form on the 
anodic surface. In the CD range, called the plateau region, a wide passiva- 
tion range existed; electropolishing occurred with negligible change in 
current density as the applied voltage increased. In this plateau operation 
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FIGURE 4.21 Potentiodynamic curve of Cu electropolishing. In addition, the optical micro- 
scope images (200x) show surface morphologies of Cu electropolished at different applied 
voltages. 


region, a nearly constant current maintained a stable electropolishing 
process. Unlike the cathodic-limiting-current plateau in electrodeposi- 
tion, which results from the depletion of metal ions in the diffusion layer 
near the electrode surface, the formation of anodic-limiting-current pla- 
teaus in electropolishing may be due to the presence of a viscous film on 
the anodic surface or the concentration barrier formed by accumulated 
dissolving metal ions. 

In the DE range of Figure 4.21, the electropolishing process occurred 
quickly because of the high applied voltage. An increase in current in this 
stage increased the rate of oxygen formation from the breakdown of water 
in the electrolyte with increasing potential. This oxygen formation reac- 
tion caused severe etch pits to be formed on the Cu surface. Optical micro- 
scope images in Figure 4.21 show that at applied voltages lower than point 
C, the polished surface was slightly etched; when the potential was in 
the plateau region (in the CD region), a smoother and scratch-free surface 
was obtained. Furthermore, when a high potential was applied (in the DE 
region), a greater amount of oxygen bubbles was generated and the work- 
ing electrode surface was pitted. 

In the process of Cu electropolishing, the polishing rate was constant 
and determined by measuring the remaining Cu thickness of the blanket 
wafer with a 1-um-thick Cu film. The polishing rates of electroplated Cu 
films were about 500 nm/min, 1000 nm/min, and 1500 nm/min for 85% 
(vol.), 70% (vol.), and 50% (vol.) of HPO, electrolytes, respectively, as calcu- 
lated from the data in Figure 4.22. The fluctuation of polishing rates away 
from linear fitting may be due to non-uniform current distribution on the 
residual Cu film. After electropolishing, the average roughness (Ra) of the 
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FIGURE 4.22 Effects of acid concentration on polishing rates and surface morphologies 


(AFM images) for Cu electropolishing at applied voltage of 1.3 V (with respect to the Ag/ 
AgCI reference electrode). 


films decreased with increasing acid concentration, as seen in Figure 4.22. 
Therefore, electropolishing is normally carried out in a limiting-current 
region and in a concentrated acid electrolyte; a higher acid concentration 
showed a higher leveling effect. 

Generally, conventional Cu electroplating will produce step on/between 
features. The traditional Cu CMP process includes step-height reduction 
of wiring metal and removal of overburden metal outside the features. 
Steigerwald et al. found that Cu dishing is a strong function of linewidth, 
but is only weakly dependent upon pattern density. At the same pattern 
density, the amount of dishing increased as the linewidth increased. In 
this study, we also encountered the same issue for Cu electropolishing. To 
measure the planarization efficiency (PE) of the CMP process, Steigerwald 
et al. defined the following equation: 


PE = [1 —(Adown/Aup)] *100% (4.3) 


where down and up are the thickness differences of the inside and out- 
side of the feature respectively, as shown in Figure 4.23. In this article, 
we also applied Equation 4.3 is also applied to monitor PE of Cu elec- 
tropolishing. A better planarization ability is noted when the PE value is 
higher. Ideally, PE is equal to 1. The following mechanism of electropol- 
ishing is suggested: the microleveling effect occurs because of selective 
dissolution. When current is applied, a passivation film covering the 
crevices of the surface—which has a high specific gravity, viscosity, and 
insulation—prevents dissolution; whereas the surface protuberances not 
covered by the passivation film—which receive greater current from the 
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FIGURE 4.23 Diagram of PE measurements, where Adown and Aup are the thickness dif- 
ferences inside and outside of the feature, respectively. 
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FIGURE 4.24 Evolution of PE values with (a) different linewidth and (b) different ratio of 
linewidth to space width. 


cathode—dissolve more quickly. This phenomenon leads to a lower dis- 
solution gradient in larger features such as in a blanket surface. Therefore, 
Figure 4.24a shows that PE decreased with increasing linewidths when 
pattern density [line/(line + space)] remained at 50%. By sputtering Cu 
into the filled features, starting profiles of filled features with greater step 
height manifest the pattern effect on planarization efficiency. 

The SEM images in Figure 4.25 indicate that the capability of step-height 
reduction in small patterns was higher than that in larger patterns. In 
these cases, the polishing time was 150 s. Table 4.1 summarizes the start- 
ing and final (after electropolishing) step heights of different features. 
Moreover, PE decreased more quickly for narrower lines than for wider 
lines. In addition to the linewidth dependence of PE, Figure 4.24b also 
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TABLE 4.1 

Starting and Final (after Electropolishing) Step Heights of Different Features 
Line/Space (um) Starting Step Height (nm) Final Step Height (nm) 

1/1 -850 -480 

2/2 ~730 ~420 

5/5 ~720 ~530 

10/10 ~720 ~550 

20/20 ~720 ~550 

1/4 ~1300 ~700 

1/2 ~1300 ~640 

2/1 ~850 ~360 

4/1 ~760 ~260 


shows that PE decreased with an increase in space width when linewidth 
remained unchanged. We suggest that dissolution current density in a 
smaller space was higher than that in a larger space, thereby resulting 
in a higher polishing rate around the outside of the feature or leading to 
an enhancement of the dissolution gradient between the gap and spac- 
ing. However, the influence of space width on PE was lower than that of 
linewidth, as shown in Figure 4.24b. 

In these Cu electroplating experiments, only PEG and inhibitor were 
used, so there was no overplating but dishing occurred with about a 100 
nm step height in a 10-um Cu line. Cu electropolishing was capable of 
eliminating the step height of such an electroplated Cu line, as shown in 
Figure 4.19. However, overplating has recently been observed for a con- 
ventional bath with a brightener. Nevertheless, Cu electropolishing is still 
able to yield a planar surface due to a higher polishing rate for overplated 
protrusions with higher current density than that for blanket regions 
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FIGURE 4.25 SEM cross-sectional profiles. 
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FIGURE 4.26 Average roughness (Ra) and resistivity of blanket Cu films with various pol- 
ishing times. 


with lower current density. On the other hand, non-uniform current dis- 
tribution is a challenge in the case of Cu electropolishing for global pla- 
narization of an entire wafer. Adding additives into polishing baths or 
introducing a multistep pulse current could improve the global unifor- 
mity within a wafer. 

Figure 4.26 reveals that the average roughness (Ra) of blanket Cu 
films decreased as polishing time increased. At the beginning of elec- 
tropolishing, the microleveling effect was more obvious than at later 
times. The trend in this figure mainly followed from the fact that a 
point-discharge effect is more efficient for a rough surface than for a 
polished surface. After electropolishing for 130 s, Ra was approximately 
1.1 nm, as compared to 13.2 nm before electropolishing. The resistiv- 
ity of polished films was not obviously changed, as also shown in 
Figure 4.26. This result suggests that no H¿PO, electrolyte diffused into 
grain boundaries of Cu films, whereas some oxidants often cause such 
damage after CMP processes, thus degrading the electrical characteris- 
tics of polished Cu metals. 


4.3.3 Cu Pad Size and Linewidth Affect Dishing 


Although the exposed Cu can immediately react with oxygen to form an 
oxide film, the film is porous and not of a self-protective nature. Therefore, 
a capping material, such as SiN, is necessary to prevent the corrosion of 
Cu. Unfortunately, the Cu corrosion depends on the delay time from the 
CMP polish of Cu to deposition of the protective layer. On the production 
line, the manufacture available time and efficiency are very hard to reach 
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these stiff conditions. Hence, the prevention of Cu corrosion becomes the 
major challenge in production to improve the yield. On the other hand, 
the grains of Cu will enlarge during subsequent thermal processing. For 
instance, the heating during dielectric material deposition would induce 
the Cu grain growth and generate the voids in Cu due to the surface area 
diminution of grain boundary. This will result in poor thermal stabil- 
ity of Cu film. In the Cu CMP process, dishing and erosion are the other 
problems for the Cu line thickness control. The occurrence of dishing and 
erosion depends on the line width and density. Besides, the CMP param- 
eters such as polishing down force, polish head rotation speed, polish pad 
elastic properties, slurry flow, polish time, and so forth affect dishing and 
erosion. Capping with a protective layer is the plausible way to avoid the 
dishing and erosion phenomena so as to improve the Cu thermal stabil- 
ity. This section investigated the dishing and erosion phenomena of Cu 
in the CMP process. The tantalum nitride (TaN) capping on the top of Cu 
surface is proposed to protect the Cu from corrosion and oxidation. The 
thermal property of Cu is also examined using stress migration to evalu- 
ate its stability. 

A three-metal-level Cu interconnect was performed. In metal 1, a single 
damascene structure was applied. Dual damascene structure of via 1, 2 
and metal 2, 3 was applied. IMP sputtering of TaN was utilized for Cu 
barrier deposition and self-ionized plasma sputtering of Cu was applied 
for the seed layer deposition of electroplating Cu. Overburden Cu was 
polished in a linear system. A two-step polish (copper and barrier metal 
were separated polish) system was carried out for the Cu-CMP process. 
An aluminum oxide abrasive system was selected for both of the polish- 
ing steps. A TaN capping process was carried out after CMP polishing. 
The TaN of 30 nm in thickness was deposited and repolished away at 
the second step of Cu CMP (Figure 4.27). Because of the selectivity effect, 
there remained a very thin TaN layer on the Cu surface. The Tencor HRP- 
20 microprofiler measured dishing and erosion. The thermal stability of 
Cu was evaluated in a furnace at 180°C for 170 h. The resistance of metal, 
Rs, was measured based on a serpentine test structure and the resistance 
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FIGURE 4.27 TaN capping process procedure. 30 nm TaN was capped after Cu CMP, and 
after TaN repolishing there remains a very thin TaN layer on the Cu surface. 
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of via, Rc, was measured based on via chain (totally 1798 via embed- 
ded with 2 mm in length and 1.5 mm in width) structure. A KLA-Tencor 
AIT-II was used to evaluate the defects induced by Cu corrosion. 


4.3.3.1 Pattern Dependence of Dishing and Erosion Phenomena 


The influence of pattern density on dishing and erosion was character- 
ized and the results are shown in Figure 4.28 and Figure 4.29, respec- 
tively. Figure 4.28 illustrates that the Cu pad size and linewidth affect the 
dishing. As the Cu pad size increases four times, the dishing increases 
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FIGURE 4.28 The effect of Cu pad size and linewidth on dishing. 
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FIGURE 4.29 The effect of Cu linewidth on erosion at fixed oxide pitch of 120 um. 
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TABLE 4.2 
Selectivity of Different Cu Polish Modules 
Thermal 
CMP Polish Module Cu TaN FSG Ox 
1st module for Cu polish 115 4 0.7 1 
2nd module for TaN polish 1 5.8 3 2.3 


to approximately two times. The erosion exhibits the same trend; it 
increases with the linewidth at a fixed oxide spacing of 120 mm, as shown 
in Figure 4.29. 

In Figure 4.28, the dishing increases in a very fast rate at the small sizes 
of Cu pad area ranging from 60 to 150 mm?. In small Cu pads, the oxide 
plays a very important role in antidishing. The higher the ratio of oxide 
area, the lesser the dishing observed. This results from the high selectiv- 
ity of oxide in the Cu polishing module using the aluminum slurry sys- 
tem. The selectivity of different Cu CMP polishing modules is shown in 
Table 4.2. 

On the other hand, the percentage of Cu overpolish will worsen the 
result. As the percentages of overpolish increase, the total polish time 
increases. This, in turn, immerses the wafer in the slurry for longer 
times. The additional immersion time caused Cu corrosion by slurry 
chemical reaction. This explains why more overpolish causes a higher 
degree of dishing. For the Cu pads larger than 150 mm, the dishing 
increased at a nearly constant rate. In these cases, the CMP polish pad 
deformation dominated because the CMP polish pad is made of poly- 
urethane that would be deformed during polishing and provide the 
CMP planarization. The limitation of polish pads deformation causes 
the constant increasing of dishing. As shown in Figure 4.29, the erosion 
also increases at a constant rate. Fixed densities of Cu lines and oxide 
pitch (120 mm) are the major cause of this phenomenon. Because the 
selectivity of Cu to oxide is approximately 200:1 (Table 4.2), the larger 
oxide area will reduce the erosion amount in small line width. The 
increase of erosion only depends on the Cu linewidth at fixed oxide 
pitch. Besides, the higher overpolish exhibited the same behavior, as 
observed in dishing experiments; the slurry chemistry effect is again 
dominant. 


4.3.3.2 TaN Cap Process for Cu Corrosion Prevention 
and Thermal Stability Improvement 


The dishing and erosion generated by the Cu CMP process could be con- 
trolled by an appropriate Cu-to-oxide-area ratio. After Cu polishing, an 
IMP-IaN layer was sputtered onto the wafer surface. The overburden TaN 
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FIGURE 4.30 (a) The cross-sectional view of the Cu surface capped with a very thin layer of 
TaN. (b) The EDX analysis of the circled area of (a). 
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FIGURE 4.31 The Rs of specimens with TaN and without TaN capping at 0.19 um in width 
and 1400 um in length. 


above the Cu and dielectric was polished away by a TaN module and a 
cross-sectional view of such a composite structure is given in Figure 4.30a. 
The analysis of a capping layer using electron dispersive spectroscopy 
(EDX) is shown in Figure 4.30b. After TaN capping, Cu was isolated from 
oxidative ambient and its corrosion was thereby prevented. However, the 
TaN capping raised the resistance of Cu (Rs). Because the composite struc- 
ture contains a high resistivity TaN layer (100 times higher than the Cu), the 
Rs of the Cu-containing TaN cap provides a 7.9% increment on resistance 
than that without TaN cap, as shown in Figure 4.31. On the other hand, the 
uniformity of Rs of composite metal is also lower than that without TaN 
capping. Furthermore, the Cu CMP repolish would worsen the non-unifor- 
mity of Rs. It is well known that Cu reacts easier with oxygen. The oxidation 
is a continuous reaction due to porous nature of Cu oxide and raises the 
resistivity of the Cu. The more Cu oxidation occurred, the less speed gain 
from the material changing from Al to Cu. In addition, the process reli- 
ability as well as the lifetime of products will shorten. The corrosion defect 
characterizations shown in Figure 4.32 and Figure 4.33 reveal that the cor- 
rosion defects dramatically increase in the specimens without TaN cap- 
ping. As for the specimens capped with TaN, the defect level remained the 
same up to 128-h heating treatment at 180°C. This observation evidenced 
that TaN capping could effectively isolate the Cu to prevent corrosion in an 
ambient environment. Thermal stability is another important issue for the 
utilization of Cu interconnection. Thermal stability of Cu was evaluated by 
the via resistance shift and the result is shown in Figure 4.34. After baking 
for 170 h in a furnace, the specimens capped with TaN exhibited a better 
thermal stability, as indicated by the Rc shift percentage characterization. 
During the following thermal process, grain growth of Cu occurred and 
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FIGURE 4.32 The relationship between waiting time and defect counts scanned by KLA- 
AIT-IL The defect counts increase with the waiting time. 
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FIGURE 4.33 (a) The corrosion defects observed after annealing for 8 h at 180°C. (b) The 
corrosion defects observed after annealing for 128 h at 180°C. 


the voids appeared. The interface of via connecting to prelayer metal is the 
preferential site of void formation. Poor adhesion between oxide and metal 
interface was observed at the bottom of via sidewall (i.e., the shrank), as 
shown in Figure 4.35. These voids deteriorated the thermal stability after 
high-temperature baking. The TaN capping is able to enhance the thermal 
stability because it restricts the Cu surface from reacting with the oxidative 
ambient and provides a good adhesion on the next Cu barrier layer, which 
is also of TaN. In addition, the TaN cap restricts the Cu line and inhibits 
its expansion during subsequent dielectric deposition. The restriction pro- 
vides a stable volume of Cu during further thermal processes and hence 
leads to a higher thermal stability of Cu. 
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FIGURE 4.34 The shift of Rc after 180°C baking for 170 h. 
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FIGURE 4.35 The via bottom (i.e, the shrank) exhibited a poor adhesion after heat 
treatment. 


The amount of Cu dishing and erosion after Cu CMP was found to 
increase with the increases of Cu pad size and overpolish time. By cap- 
ping a thin TaN layer on Cu, the Cu surface was effectively isolated from 
the oxidative ambient and the corrosion is presented. Furthermore, there 
is no increase of defect density in the specimens incorporating with the 
TaN capping process. The TaN capping process also provided a better 
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thermal stability of Cu during subsequent thermal treatment. The only 
flaw of TaN capping is the increase of Rs, which will deteriorate the opera- 
tion speed of devices. However, it could be overcome by design optimiza- 
tion of the circuits. 


5 


Nanotopography 


5.1 What Is Nanotopography? 


The industry definition of nanotopography covers a spatial wavelength 
range of approximately 0.2 to 20 mm. This region essentially defines sur- 
face topography dimensions larger than roughness. Roughness is defined 
as the smaller size spatial features on the wafer (an analogy to grains of 
sand in the desert), and nanotopology defines wider features up to approx- 
imately a lithographic field site. Nanotopography defines the nanometer- 
scale height variations that exist on lateral millimeter-scale wavelength on 
an unpatterned silicon wafer. The characteristics of the variation depend 
on the specific wafer manufacturing; process used to generate a particular 
wafer. An illustration of wafer nanotopography is shown in Figure 5.1. The 
height variation is typically 100 nm with a lateral length scale between 1 
and 20 mm. This is a parameter that measures the front-surface, free-state 
topology of an area that can range in size from fractions of a millimeter to 
tens of millimeters. 

Nanotopography is the surface topography of wafers placed on a flat 
stage without chucking or clamping. It has been known as the waviness 
visually represented with an optical tool called a magic mirror, which 
provides images that are qualitative pictures of topography variations 
on the frontside surface of wafers. It has a peak-to-valley height that is 
considered to vary between several nanometers and several hundred 
nanometers, and its spatial wavelength range is considered approximately 
up to 20 mm. 

In typical CMP machines, a front-side surface reference is employed, 
and the backside surface of a wafer touches soft carrier films or airbags, 
both of which absorb the topography variations of the backside surface. 
Thereby, wafers are in a condition without chucking or clamping in the 
CMP machines. Moreover, nanotopography differs from front-referenced 
site flatness in that for nanotopography the wafer is measured in a free 
state, while for flatness it is referenced to a flat chuck. A wafer may have 
perfect flatness (in the classical definition of flatness) yet still have nano- 
topography. If a wafer has surface irregularities on the front and backside 
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FIGURE 5.1 (See color insert) Illustration of wafer nanotopography. 


of the wafer, but front and back surfaces are parallel, the wafer has per- 
fect flatness. However, the same wafer will exhibit nanotopography 
(Figure 5.2). 

Nanotopography bridges the gap between roughness and flatness in 
the topology map of wafer surface irregularities in spatial frequency 
(Figure 5.3). As linewidth shrinks with non-uniform pattern density and 
with the use of hard pads for CMP, nanotopography may significantly 
degrade the dielectric film uniformity. 

Nanotopography is measured by two techniques: light scattering and 
interferometry. Light scattering tools typically employed for particle and 
surface-defect characterization can be used to measure the local slope 
change over the entire surface of the wafer. The local slope change may 
be integrated to yield height or topography information. Since the beam 
size can be on the order of fractions of a micron, nanotopography can 
be measured. Optical interference measurement is straightforward: A 
beam is split into two components—one component is reflected from the 
wafer surface and the second is reflected from a reference mirror. The 
interference of the combination of the two beams is a measurement of the 
topology of the wafer surface. With both techniques, signal filtering is 
used to separate the low-wavelength features (i.e., warp) so that only the 
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FIGURE 5.2 Nanotopography variations on the frontside and backside condition. 
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FIGURE 5.3 (See color insert) Topology map of wafer surface. 


high-wavelength/low-frequency information, (i.e. the true surface nano- 
topography) is measured. The equipment used in measuring the nano- 
topography of the wafer will be introduced in Section 5.4. 


a! 
5.2 Why Nanotopography Is Important 


Recently, the nanotopography of the surface of silicon wafers has become 
an important issue because it may seriously affect the post-CMP unifor- 
mity of thickness variation of dielectrics. 

Semiconductor device fabrication on silicon wafers comprises steps at 
which layers are deposited, subsequently planarized, and structured. 
Planarization is typically performed using a polishing step where the 
smoothing of the layer is due to chemical interaction with the polishing 
slurry as well as mechanical abrasion. Therefore, such processes are called 
chemical-mechanical polishing (CMP) and they have been implemented, 
for example, in silicon wafer manufacturing for more than 30 years. The 
homogeneity of a post-CMP layer is limited by fluctuations of the combi- 
nation of layer deposition and CMP, as well as the frontside topography of 
the substrate. These two contributions to post-CMP layer thickness devia- 
tions of oxide film need to be quantified properly for identifying poten- 
tials that allow improvement of the efficiency of planarization for future 
devices. This is particularly necessary since excessively large-layer thick- 
ness variations after CMP may have a negative impact on device perfor- 
mance such as leakage and the pinhole effect, and the thickness of layers 
might even decrease with the ongoing reduction of critical dimensions. 
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Concern over local wafer site flatness for advanced lithography has led 
to new wafer surface topography requirements. These requirements are 
typically driven by CMP where film thickness variations can result in 
uneven surface topography. Control over local wafer site flatness becomes 
important for device geometries smaller than 0.25 um. 


nal 
5.3 Impact of Nanotopography on CMP 
5.3.1 General Introduction 


The interaction of nanotopography upon film polishing uniformity in 
CMP has been under extensive investigation by Boning and co-workers 
and Tamura et al. (2000). The primary effect of oxide uniformity removal 
is due to the hardness of the CMP pad. The fundamental concept is very 
simple: soft polishing pads conform to local topology variations (i.e., nan- 
otopography), whereas hard pads do not. Figure 5.4, adapted from Boning 
et al. (2000), illustrates this principle. Typically, a wafer has a characteristic 
nanotopography length (NL, shown in the top illustration of Figure 5.4). 
The soft pad will conform over the nanotopography and maintain a uni- 
form film. The hard pad will not conform to the nanotopography and 
produce a non-uniform film with high spots on the wafer surface having 
a thinner film and low spots having a thicker film. Traditionally soft pads 
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FIGURE 5.4 (See color insert) Basic concepts of soft and hard polishing pads. 
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have been used for film polishing in CMP. However, with the need for bet- 
ter planarization because of more layers, smaller critical dimension, and 
for multifunction logic devices that have several different areas of vary- 
ing pattern densities, stiff pads are required. To some extent, the effect of 
nanotopography can be minimized by using polishing additives, such as 
ceria particles. Nonetheless, wafer nanotopography becomes increasingly 
important. 

To understand the influence of nanotopography in CMP film polishing 
uniformity, the concept of planarization length should be considered. The 
planarization length (PL; Figure 5.5) is the spatial length at which polish- 
ing cannot reduce the step height of a feature in the film thickness, as 
shown in Figure 5.5. The important aspect to consider is when PL is less 
than NL, the film uniformity is maintained; however, when PL is more 
than NL one could find non-uniform film polishing. Two typical exam- 
ples are shown in Figure 5.4. The CMP process and the film uniformity 
specifications may be considered to determine the level of nanotopogra- 
phy required. 

Nanotopography of the silicon wafer is dictated to a large extent by the 
polishing process. For single-sided polished (SSP) wafers, the polishing 
process has been optimized to minimize nanotopography. In this pro- 
cess, to achieve good flatness, the wafer must be mounted or chucked 
against a flat reference block. Since the wafer backside is etched (not 
polished smooth), it has surface topology. Because of the fixing process 
used to mount the wafers (e.g., wax mounting or vacuum chucking), the 
topology of the backside of the wafer and the fixing surface or adhesive/ 
wax are transmitted to the front side and causes nanotopography. The 
other technique of mounting a wafer (the one that is normally used in 
CMP), namely, free mounting, does not cause nanotopography formation, 
but also does not guarantee the wafer is made flat. The best flatness and 
nanotopography is obtained when the wafers are double-sided polished 
(DSP). The true, planetary, free-floating DSP process polishes both sides of 
a silicon wafer simultaneously. Since the wafer is polished in a free state, 
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FIGURE 5.5 Basic concepts of soft (left) and hard (right) polishing pads. 
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FIGURE 5.6 (See color insert) A comparison of SSP and DSP mounting techniques and how 
these affect nanotopography and flatness. 


nanotopography is minimized. Also, good flatness is achieved. Thus, 
both good flatness and nanotopography are produced. Figure 5.6 shows 
a comparison of SSP and DSP mounting techniques and how these affect 
nanotopography and flatness. 

Wafer manufacturing developed a planetary DSP process that provides 
wafers with leading-edge nanotopography and flatness characteristics 
that meet all of the increasing demands of CMP requirements. Although 
planetary DSP is technically the best method to achieve superior nano- 
topography and flatness, there are several barriers to practically applying 
this method in the fab. These barriers include cost of ownership for DSP, 
issues related to running both a polished backside DSP and an etched 
backside wafer in their lines at the same time, electrostatic chuck prob- 
lems, and in-line sensor calibration. Wafer manufacturing companies are 
actively exploring both planetary DSP and SSP methods that promise to 
achieve a good balance between nanotopography and flatness results and 
cost of ownership. 


5.3.2 Spectral Analysis of the Impact of Nanotopography 
on Oxide CMP and Fourier Transform Method 


In general, it is quite convenient to apply spectral analysis to research the 
impact of nanotopography on film thickness variation in CMP. The sur- 
face height changed as the nanotopography of wafers were measured by 
an ADE NanoMapper. Figure 5.7 shows an example of nanotopography 
of a wafer measured by ADE NanoMapper. The darker region in the map 
corresponds to a lower height and the brighter region to a higher height. 
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FIGURE 5.7 (See color insert) Nanotopography map for a wafer. 
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FIGURE 5.8 Power spectral densities of nanotopography and pre-/post-CMP film thick- 
ness variation with different pads. 


Figure 5.8 shows the power spectral densities (PSDs) of nanotopography 
and pre-/post-CMP film thickness variation. The higher the PSD num- 
ber, the higher nanotopography in the surface of the wafer. To directly 
relate the nanotopography and the film thickness variation after CMP, we 
introduce a theory conducted by the professor J.G. Park, who used Fourier 
transform function to convert PSDs to a more understandable parameter 
called transfer function T(p, t). 

Park and co-workers (2001) proposed a formula that describes the rela- 
tionship between the nanotopography and the film thickness variations 
after CMP. The discussion is based on the concept of the planarization 
length of CMB which is mentioned in Section of 5.3.1. Park et al. only deal 
with blanket (unpatterned, one material) wafers and the uniformity of the 
local polishing rate causing the fluctuations of surface height. The follow- 
ing formula is for one dimension: 


h(x,t)=w(x)+ f(x,t) (5.1) 
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The height of the oxide surface at any time during polishing can be denoted 
as: x is position, t is time, h(x, t) is the height of the oxide surface, w(x) is 
the nanotopography of the silicon surface, and f(x, t) is the film thickness 
of the oxide. To simplify the formulation, the origin of the height axes for 
h(x, t), w(x), and f(x, t) can be arbitrary changed. 

The raised regions on the surface may cause the dynamical excess defor- 
mation of the polishing pad incurred during polishing. The deformation 
is propagated within a certain lateral range and it is a nonlocal phenom- 
enon. It can be described by the spatial-convolution function in the real 
domain. The excess deformation of the polishing pad may affect the local 
removal rate. Supposing its effect is the linear correlation, the local polish- 
ing rate can be given by: 


Oh(x,t) _ —(K, +K, -CMP @h(x,t)) 
= 6.2) 


where K, is the blanket polishing rate without considering nanotopogra- 
phy, K; is the coefficient for the local excess-polishing rate caused by sur- 
face-height fluctuation, CMP, (r) is the response function (r is the lateral 
distance from the origin) describing the nonlocal pad deformation and its 
affect on the polishing rate. The function CMP, (r) is an even function and 
closely related to polishing parameters, such as the polishing-pad hard- 
ness, relative velocity between pad and wafer, polishing pressure, and so 
forth, through the planarization length that characterizes the lateral range 
over which the raised topography interacts. After integration of Equation 
5.2 for time, h(x, t) is expressed as: 


Hen=M0)-] r+ K, if “CMP, ent 
i (5.3) 


The unknown function h(x, £) is too complicated to solve analytically, so 
a simplification is applied below. 

When t =0, if the film thickness f(x,0) is uniform enough for our analy- 
sis, the initial surface-height fluctuation can be denoted as follows using 
Equation 5.1 (Note: the origin of the height axis is variable). 


h(x,0)= w(x) (5.4) 


The spatial profile of h(x, t) is continuously changing as the polishing 
is performed. However, supposing that the h(x, t) can be described as the 
convolution of h(x, 0) (as the initial profile) and another time-dependent 
proper response function CMP, (r, t), the equation is given by: 
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CMP, Qh(x,t)= CMP,(t) @w(x) 


(5.5) 
Then, Equation 5.3 can be rewritten as: 
t 
h(x,t)=w(x)— x, f CMP, (t) @w(x)dt 
0 
t 

=w(x)— K f CMP, (t)dt) |Sw(x) (5.6) 

0 


=w(x)— CMP(t) @w(x) 


where CMP (r, t) is a time-dependent spatial-convolution function, and 
the term of K,in Equation 5.3 is omitted because it is x-independent. After 
Equations 5.1 and 5.6, the film thickness f(x, f) can be expressed as: 


f (x,t) = —CMP(t) @w(x) (5.7) 


Next, the Fourier transformation of Equation 5.7 is denoted as: 


F(),t)=—FTCMP(),t)- WO) (5.8) 


where i is the spatial wavelength, F(A, t), WA), and FTCMP(t) are the 
Fourier transforms of f(x, t), w(x), and CMP(t), respectively. After Equation 
5.8, the relationship between the power spectral densities of film thick 
ness variation and the wafer nanotopography can be expressed as: 


FO ron wo) (5.9) 


where T(A, t) is |FTCMP(, t)|? , and it is a time-dependent response func- 
tion and should still have a characteristics length originating the pla- 
narization length, which depends on the CMP parameters mentioned 
earlier. This response function T(), t) can be regarded as the transfer func- 
tion from the nanotopography spectrum to the film thickness variation 
spectrum. That is to say, if this transfer function has a large value ina cer- 
tain wavelength region, the component of the nanotopography impacts 
severely on the film thickness variation after CMP. 

Figure 5.9 is an example of two calculated transfer functions with wave- 
length. As mentioned earlier, the larger value of transfer function in cer- 
tain wavelengths means the stronger impact of nanotopography on film 
thickness variation. 
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FIGURE 5.9 Transfer function as the ratio of the oxide film thickness variation to that of 
nanotopography. 


The more detailed analysis of the Fourier transform function of Equation 
5.9 and the correlation between T(A, t) and the nanotopography impact on 
the film thickness variation with different pad hardness will be discussed 
in the next section. 


5.3.3 Impact of Nanotopography on Silicon Wafer on Oxide CMP 


5.3.3.1 Wafering Method Dependency of Impact of 
Nanotopography on Oxide CMP 


In addition to CMP process optimization, better silicon wafers are required 
to solve the nanotopography problem. In general wafer manufacturing 
(wafering), chemical etching is applied to remove the mechanical damage 
induced during the preceding lapping process. This is followed by a pol- 
ishing process to improve the parallelism of the wafer and create a very 
flat surface. The polishing process applied during wafering can be either 
DSP or SSP. For 8-inch wafers, device manufacturers generally require 
SSP because the frontside of the wafer is then distinct from the backside. 
To improve the nanotopography of SSP wafers, a combined chemical etch- 
ing and SSP process that can be applied during wafering was developed, 
and the performance of the resulting wafers for oxide CMP was examined 
by the analysis method described earlier. 

Three chemical etching methods were applied: acid etching, alkali etch- 
ing, and multietching (acid and alkali), which allowed us to obtain the 
benefits of both acid and alkali etching in a compatible process. To pre- 
pare the wafers to be tested, we used two polishing methods: conventional 
single-side polishing with wax mounting (SSP1) and improved single-side 
polishing (SSP2), in which the backside topography has little effect on 
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the polishing of the front surface. Combining the three etching methods 
and the two polishing methods yielded six types of sample wafers. In 
this book, we will use a notation based on the letters and numbers given 
above to distinguish between wafer types, thatis, Al refers to a wafer that 
underwent acid chemical etching and wax mounted polishing (SSP). 

Professor J.G. Park has produced a series of experiments to understand 
the effect of the different etching method and the different polishing 
method on film thickness variation after the CMP process. Figure 5.10 
shows the height maps and line profiles of the six wafer types. Table 5.1 
shows the parameters in this experiment of oxide CMP. 


Height (A) 


Height (A) 


Height (A) 


FIGURE 5.10 Nanotopography maps of wafers prepared using each process combination. 
Darker regions of the map correspond to a lower height and brighter regions to a greater 
height. The height profiles along the x-axis are shown. 
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TABLE 5.1 
Standard Deviation of Height Profile (nm) 
SOM and NanoMapper and DynaSearch and 

NanoMapper DynaSearch SQM 
Measurement 1 23.29 17.20 29.57 
Measurement 2 13.00 13.21 18.23 
Measurement 3 17.85 16.53 26.55 
Measurement 4 12.15 11.57 19.95 
Measurement 5 14.40 14.62 21.20 


As shown in Figure 5.10, the Al wafer had the largest nanotopography, 
whereas the C2 wafer had the smallest. Comparing the polishing meth- 
ods, we observe that SSP2 resulted in smaller height variation than SSP1. 
Figure 5.11 shows examples of the correlation between nanotopographic 
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FIGURE 5.11 Examples of the correlation between the nanotopography and inverse film 
thickness profiles before and after CMP for (a) A1 wafer and (b) C2 wafer. 
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line profiles and film thickness variation before and after CMP. Figure 5.11a 
shows that for the Al wafer and Figure 5.11b shows the C2 wafer. The 
peak and valley positions of the nanotopography coincided well with the 
film thickness variation after CMP, particularly for the A1 wafer. It can 
be observed that even the small nanotopography of the C2 wafer slightly 
influenced the film thickness variation. We attributed the fluctuations in 
the film thickness after CMP to the nanotopography. Note that the film 
thickness profiles are inversely plotted, which means that excess thinning 
occurred due to a pressure concentration at each local peak position of the 
nanotopography. 

In Figure 5.12, the standard deviation of the nanotopography and the 
film thickness variation after CMP are summarized for the six wafer types. 
Each value was calculated from the filtered profiles. For all types of etching, 
particularly acid etching, SSP2 more effectively reduced the nanotopogra- 
phy impact on the post-CMP film thickness variation. Wax mounting was 
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FIGURE 5.12 Standard deviations (Rms) of (a) the nanotopography and (b) the filtered film 
thickness after CMP. 
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used in SSP1 to hold the backside of the wafer on the flat ceramic plates. 
The backside waviness due to the chemical etching caused excessive height 
variation on the front surface through the internal stress distribution 
within the wafer. After stock removal polishing, the front surface became 
smooth, but once the wafer was removed from the ceramic plate, the inter- 
nal stress was released and variation of the front surface height caused by 
the backside waviness appeared. This is probably the reason for the SSP1 
wafers showing more nanotopographic variation than the SSP2 wafers. 
Among the different etching methods, acid etching resulted in the greatest 
nanotopographic variation, whereas the combined acid and alkali etch- 
ing would account for the greater nanotopographic variation. Multietching 
can be optimized to reduce the waviness by balancing the acid and alkali 
factors. The variation that appears as waviness after the chemical etching 
leads to nanotopography due to the backside influence described above 
and /or residual waviness on the front side after polishing. 

The correlation between the standard deviations of the OTD (oxide 
thickness deviation) and the NH (nanotopography height) is plotted in 
Figure 5.13. The OTD is independent of the nanotopography before CMP. 
On the other hand, the OTD and NH after CMP show a clear, positive 
correlation. The slope of the fitting line for the post-CMP correlation in 
Figure 5.13 may vary, depending on the CMP process parameters. A small 
slope indicates that the nanotopography is less likely to have an impact on 
the effectiveness of the CMP process. 

We also calculated the PSDs of the NH and OTD. The PSDs for the A1 
and C2 wafers are shown in Figure 5.14 as examples. This analysis quanti- 
tatively demonstrates that the C2 (multietching, SSP2) wafer had less nan- 
otopography than the A1 wafer within the wavelength range from 2 to 100 
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FIGURE 5.13 Correlations between the Rms of the NH and that of the OTD. 
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FIGURE 5.14 Power spectral densities of (a) height changes and (b) film thickness varia- 
tions before and after CMP for the Al and C2 wafers. 
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mm (Figure 5.12a). Regarding the film thickness variation (Figure 5.12b), 
the spectra before CMP were independent of the wafer nanotopography. 
On the other hand, the PSD of the OTD increased after CMP in accor- 
dance with the wafer nanotopography, which clearly demonstrates that 
the C2 wafer had a more uniform film thickness than the A1 wafer within 
the wavelength range from 2 to 30 mm. 

A clear and positive correlation between the standard deviation of the 
nanotopographic profile and that of the film thickness after CMP is evi- 
dent. The PSD analysis of the CMP results quantitatively demonstrated 
the effect of the nanotopography of different wafers on the film thickness 
variation. Wafers prepared with the improved SSP technique had less film 
thickness variation after unpatterned oxide CMP than the wafers that 
underwent conventional SSP due to the impact of the nanotopography. 
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5.3.3.2 Slurry Characteristic Dependency of 
Impact of Nanotopography on Oxide CMP 


To quantitatively analyze the impact of nanotopography on post-CMP 
oxide film thickness, we have introduced a spatial spectral method, and 
have used it to examine the effect of the pad type, removal depth, wafer 
manufacturing technique, and polishing method. However, the role of the 
slurry in controlling the impact of nanotopography on STI CMP is not yet 
clear. In this section, we discuss how the concentration of surfactant and 
abrasive size affect the impact of nanotopography. 

Four kinds of slurry with different sizes of abrasives, denoted as A, B, C, 
and D, were prepared through a mechanical treatment. Figure 5.15 shows 
the correlation between the lateral profiles of the NH and postCMP OTD 
on a wafer for two surfactant concentrations. Figures 5.15a and b corre- 
spond to slurry A (largest abrasives) with surfactant concentrations of 
0 and 0.80 wt%, respectively. Figure 5.15c and d correspond to slurry D 
(smallest abrasives), also with surfactant concentrations of 0 and 0.80 wt%, 
respectively. For each of the various surfactant concentrations, the peak 
and valley positions of the NH and post-CMP OTD coincide well with 
each other. Therefore, the fluctuations in the post-CMP OTD can be attrib- 
uted to the wafer nanotopography. Whereas the magnitude of the OTD for 
slurry A is similar with or without surfactant, that for slurry D increased 
with the surfactant concentration. 
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FIGURE 5.15 Correlation between the NH and the post-CMP oxide thickness variation: (a) 
Slurry A, 0 wt%; (b) Slurry A, 0.8 wt%; (c) Slurry B, 0 wt%; (d) Slurry B, 0.8 wt%. 


Nanotopography 127 


Rms of Post-CMP OTD (A) 


FIGURE 5.16 Standard deviation (Rms) of the post-CMP oxide thickness deviation. 


The correlation between the abrasive size, the surfactant concentration, 
and the standard deviation of the OTD is shown in Figure 5.16. For slur- 
ries A and B, the standard deviation of the OTD is not influenced by the 
surfactant concentration. For slurry C, however, the standard deviation 
increases gradually with increasing surfactant concentration. For slurry D, 
it increases even more strongly with increasing surfactant concentration. 
Thus, when slurry with smaller abrasives is used, the magnitude of the OTD 
increases with increasing surfactant concentration. That is, the surfactant 
more strongly influences the nanotopography impact on OTD after CMP 
in the case of smaller abrasives. Though other factors, such as the selec- 
tivity of the removal rate between oxide and nitride films, must be taken 
into account when discussing the influence on the actual STI CMP process, 
these findings show that the nanotopography impact can be controlled by 
manipulating the slurry characteristics. Note that even if the ceria slurry 
generally used in STI CMP has high oxide-to-nitride removal selectivity, 
it basically avoids only excessive thinning of the nitride. It does not avoid 
incomplete clearing of the oxide as an influence on the nanotopography. 

The PSDs of the post-CMP OTD for three surfactant concentrations are 
shown in Figure 5.17. The PSD increased with increases in the surfactant 
concentration in the wavelength range up to 30 mm. This result suggests 
that the change in planarization efficiency (or nanotopography impact) as 
a result of adding the surfactant occurs even in a longer wavelength range 
up to around 30 mm. 

The mechanism for the impact of nanotopography on post-CMP OTD 
is directly related to the planarization of the oxide surface. That is, the 
local polishing rate of the protruding areas produced by nanotopography 
is greater than that of the valley areas, which causes excessive thinning 
of the oxide film at each nanotopography peak, as shown in Figure 5.18. 
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FIGURE 5.17 Power spectral densities of oxide thickness deviation after CMP. 


Boning and Lee (2002) simulated this nanotopography impact through a 
contact mechanics model of the local contact pressure of the polishing 
pad on the wafer surface. However, an analysis that considers only the 
polishing pad and the film surface topography cannot explain the results 
of this study, as the role of the surfactant in planarization must also be 
taken into account. Nojo et al. (1996) proposed a useful model that contrib- 
utes to our understanding of these results. They reported that adding a 
surfactant to ceria slurry can result in “self-stopping polishing,” in which 
the polishing rate drops automatically as planarization progresses. They 
attributed this to the protective layer formed on the surface of the film by 
the surfactant. We expanded this model to take account of the effect of 
the abrasive size, as shown in Figure 5.18a. The larger abrasives marked 
A and B in the figure can remove both protruding areas and valley areas, 
but the smaller abrasives marked C and D can remove only protruding 
areas due to the thicker surfactant layer when the surfactant is added to 
the slurry. Accordingly, for the smaller abrasives case (marked C and D), 
the surface of the post-CMP film in the slurry with surfactant becomes 
flatter than that in the slurry without surfactant. A flatter post-CMP film 
surface corresponds to a more severe impact of nanotopography on OTD, 
as shown in Figure 5.18b. As a result, this model can explain the results 
shown in Figure 5.16. 

In conclusion, the magnitude of film thickness variation after CMP 
was found to increase with the surfactant concentration in slurries with 
smaller abrasives but to be almost independent of the surfactant concen- 
tration in the slurry with the larger abrasive. This result can be explained 
with the model based on the passivation layer of the surfactant adsorbed 
on the oxide film surfaces during polishing. 
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FIGURE 5.18 Proposed model for the nanotopography impact: (a) dependence of surface 
planarization efficiency on the abrasive size and surfactant and (b) relationship between 
film surface flatness and oxide thickness deviation after CMP. 
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FIGURE 5.19 Correlation between simulated and experimental polysilicon thickness varia- 
tion reduced by wafer nanotopography. 
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FIGURE 5.20 PV of remaining polysilicon thickness after polysilicon CMP, depending on 
PV of wafer nanotopography in 10-mm scan. 


5.3.3.3 Effect of Wafer Nanotopography on Remaining 
Polysilicon Thickness Variation after Polysilicon CMP 


Figure 5.19 shows the centerline profile for the wafer nanotopography 
(top line profile) and that for the remaining polysilicon thickness varia- 
tion (middle profile line). The peak and valley positions of inverted 
thickness variation are well correlated with that of wafer nanotopogra- 
phy. This indicates that wafer nanotopography directly influences the 
thickness variation after polysilicon CMP. The bottom line in Figure 5.19 
shows the simulated remaining polysilicon thickness variation induced 
by wafer nanotopography, based on the wear-contact model. It is obvi- 
ous that the measured thickness variation (middle profile line) is well 
correlated with the simulated one (bottom line profile). This indicates 
that the polysilicon CMP mechanism using colloidal silica slurry follows 
Prestonian behavior rather than non-Prestonian behavior. Figure 5.20 
shows the peak-to-valley (PV) value along the 10-mm radial scan of the 
remaining polysilicon thickness variation after polysilicon CMP as a 
function of the PV value along the 10-mm radial scan of wafer nano- 
topography. The PV value after polysilicon CMP exponentially increases 
with that of wafer nanotopography. Again, the wafer nanotopography 
strongly affects the remaining polysilicon thickness variation after poly- 
silicon CMP, suggesting that the wafer nanotopography results in Vy 
variation for a NAND-flash memory-cell fabricated with the self-align- 
ment of polysilicon floating-gate via polysilicon CMP. This is because 
both the voltage coupling of the floating-gate and the floating-gate inter- 
ference in the NAND-flash memory-cell are determined by the height 
of the polysilicon floating-gate, which is influenced by the remaining 
polysilicon thickness variation after polysilicon CMP. 
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FIGURE 5.21 (See color insert) Schematic structures of a 63-nm NAND-flash memory- 
cell with different floating-gate heights, induced by the wafer nanotopography of 10-mm- 
diameter scanning: (a) 45-nm height (free of wafer nanotopography influence); (b) 30.1-nm 
height (at the top of wafer nanotopography influence); (c) 59.9-nm height (at the bottom of 
nanotopography influence). 


5.3.3.4 Effect of V, Variation of Wafer Nanotopography on Remaining 
Polysilicon Thickness Variation after Polysilicon CMP 


A 63-nm NAND-flash memory-cell has a chip-side size of approximately 
10 mm, as shown in Figure 5.21. We assumed that after polysilicon CMP 
the peak position of wafer nanotopography produces the smallest height 
of the polysilicon floating-gate (Figure 5.21b) and that the valley value of 
wafer nanotopography produced the largest height of polysilicon float- 
ing-gate (Figure 5.21c). To calculate the V; variation originating from the 
PV value of the remaining polysilicon thickness after polysilicon CMP 
induced by wafer nanotopography, we should extract the floating-gate 
voltage determined by the floating-gate voltage coupling ratio and the 
floating-gate interference effect. Figures 5.22a and b show a three-dimen- 
sional schematic drawing of NAND-flash memory-cells and top views of 
each cell. Here, it was assumed that memory cell No. 5 was programmed. 
The applied voltage on the control gate, Vg, at the selected word line was 
18 V (Vp), while that at the unselected word line was 9 V (Vass) for the 
63-nm cell. The selected word-line bias was 16 V (V,,,,,), and the unselected 
word-line bias was 8 V (V,,,,.). The applied voltage on the selected bit line 
was ground (GND), while that at the unselected bit line was 1.8 V (V,,). 
Therefore, the channel-regions bias (V,,) of unselected bit-line cells was 
floating. 
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FIGURE 5.22 (See color insert) Programming operation of a NAND-flash memory-cell, 
where the programming cell is No. 5: (a) voltage bias conditions for memory cells and (b) 
parasitic capacitances during No. 5 cell programming. 


The device dimension for the 45-nm cell was extracted from the dimen- 
sional trend for 90-, 70-, and 63-nm technology nodes. According to this 
trend, the height, length, and width of the floating-gate are 32, 45, and 57 
nm, respectively. For the 63-nm NAND cell, oxide-nitride-oxide (ONO) 
layer and tunneling oxide thicknesses (equivalent oxide thickness, EOT) 
were 14.5 and 6 nm; for the 45-nm one, they were 11.9 and 5.2 nm, respec- 
tively. Each of the parasitic capacitances was obtained using the equation 
C = ¢,e,A/d, where £, e, A, and d are the dielectric constant for insulating 
film, dielectric constant for vacuum, capacitor area, and insulation film 
thickness, respectively. Cupca-ures Cxros-xeco Cyrâo-urâsr ANd Cyros-yrcs are 
the parasitic capacitances between the floating-gates at cells 4 and 5, 5 
and 6, 2 and 5, and 5 and 8, respectively. C,rc2yces and Cyrcs-yces are the 
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parasitic capacitance between the floating-gate at cell 2 and control-gate 
at cell 5 and between the floating-gate at cell 5 and control-gate at cell 
8, respectively. The other parasitic capacitances were neglected because 
their values are compared with those of the above-mentioned six parasitic 
capacitances. Since the ONO capacitance at the programmed memory-cell 
(Cono) depending on the height of floating-gate is related to the voltage 
coupling ratio, the calculation of Cono was performed with a device simu- 
lator (ATLAS, Silvaco Corp.) Using the results of two-dimensional Poisson 
equation solving, Coyo was calculated by the formula 


Cono 
= V, ¿Coro = 1 
Cono F Crun E ve Veg E Vig > 0) 


FG 


where Vps and Cruy are the floating-gate voltage without considering the 
floating-gate interference effect and tunnel oxide capacitance. As shown 
in Figure 5.23, Cono for the 63- and 45-nm memory-cell linearly increases 
with increasing polysilicon floating-gate height determined by the PV of 
the remaining polysilicon thickness after polysilicon CMP. After the cal- 
culation of Cono, the floating-gate voltage, considering the floating-gate 
interference effect (Vpcs), was calculated by 


_ ConoVcas + Cxraa-xras Vraa + Ceres—xrc6V ras + Cyrez- yras Vra2 +C ras-yrasVrGs + Crece ( Vea + Vecs) 
Cruw + Cono + Cusca-aras + Curas-xpâs + Cupa yras ¡A + Crece 


FG5 


(5.11) 


where VrauVeras Vraz and Vrgs are the floating-gate voltage at cells 4, 6, 2, 
and 8, respectively, and Veg, and Vegg are the control-gate voltage at cells 
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FIGURE 5.23 Calculated dependence of Cono on polysilicon floating-gate height. 
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FIGURE 5.24 Dependence of floating-gate voltage on floating-gate height with program 
voltage biasing. 


2 and 8, respectively. For the 63-nm cell (63-nm gate length and 80-nm 
gate width), the distances between the floating-gates along the word line 
and bit line are 63 and 50 nm, respectively. For the 45-nm one (45-nm 
gate length and 57-nm gate width), they are 45 and 50 nm, respectively. 
Programming voltages were assumed to be 18 and 16 V for the 63- and 
45-nm cells, and the pulse duration was 200 psec. Figure 5.24 shows Vr; 
as a function of polysilicon floating-gate height determined by the PV of 
the remaining polysilicon thickness after polysilicon CMP. V;¢5 increases 
with polysilicon floating-gate height. The voltage coupling ratio for the 
63-nm cell (at 45-nm floating-gate height) is approximately 2.2% larger than 
that for 45-nm one (at 32-nm floating-gate height). This indicates that the 
floating-gate voltage coupling becomes weaker as memory size becomes 
smaller because of the reduction of floating-gate voltage coupling and the 
enhancement of floating-gate interference. 

Fowler-Nordheim tunneling-current density, well known as the pro- 
gramming mechanism of a NAND-flash memory-cell, is represented by 


q E? -42m (a) 


= exp 
161 ho, 3hqE 


(5.12) 


where fi is the conduction barrier height between SiO, and silicon sub- 
strate and m is the effective mass of electron. The size of the active area 
generating Fowler-Nordheim tunneling is W x L = 514 x 63 nm for the 
63-nm NAND and 36.7 x 45 nm for the 45-nm NAND. The number of elec- 
trons can be approximately obtained because the current density is con- 
verted to charge when the pulse duration is 200 psec, as assumed earlier. 
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FIGURE 5.25 Number of tunneling electrons versus floating-gate voltage and electric field 
on tunneling oxide at cell No. 5. 


The result is shown in Figure 5.25, where the x-axis is the voltage applied 
to the tunneling oxide and electric field, and the left y-axis is the number 
of tunneling electrons. Although, the number of tunneling electrons in 
the 63-nm NAND is larger than that of 45-nm NAND-flash memory-cell, 
V. variation of the latter is severe. This is because the tunneling oxide 
capacitance of the 63-nm NAND is 1.99 times bigger than that of 45-nm 
NAND, due to the large active silicon area. 

As Vz, is changed with floating-gate height, the variation of V, defined 
by AV, is represented by 


AQ = Qe 45+anm QFN 45-a a 


Coun snm 


AV > nm = 


(5.13) 


AQ (= Qew s2+anm — QEN 32-0 de) 


Crun 45nm 


AV > 45 = (5.14) 


where a is the height of the changed floating-gate. For instance, if the 
height of the floating-gate has the variation of 10 nm, then a is 5 nm. 
Therefore, AQry is Qensoum — Qrnaonm The variation of Vy classified by 
thickness, calculated earlier, is represented by the function of the PV of 
the polysilicon floating-gate height induced by wafer nanotopography, as 
shown in Figure 5.26. The final V; variation represents the difference in 
the variation between two cells that have different polysilicon floating-gate 
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FIGURE 5.26 Threshold voltage of NAND-flash cell shift due to film thickness variation. 
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heights. Thus, for both the 63- and 45-nm NAND-flash memory cells, Vy 
variation linearly increased with the PV of the polysilicon floating-gate 
height induced by wafer nanotopography. However, the V; variation for 
the 45-nm cell is higher. The PV of the polysilicon floating-gate height and 
the PV of the wafer nanotopography are related, as shown in Figure 5.21c. 
Accordingly, the Vy variation linearly increases with the PV of the wafer 
nanotopography, as shown in Figure 5.27. The impact of the Vy variation 
for the 45-nm NAND is higher than that for 63-nm. This shows that the 
larger the direct influence of the wafer nanotopography, the larger the Vy 
variation becomes. It is intuitive that the impact of the wafer nanotopogra- 
phy on the V; variation becomes larger as the memory-cell-device design 
rule (gate length) becomes smaller. Thus, optimizing silicon-wafer fabri- 
cation processes to reduce the PV of wafer nanotopography, and thereby 
minimize the V+, variation, is key to improving device yield. 


a SSe 


5.4 Equipment in Measuring the Nanotopography 


Three kinds of instruments for the characterization of nanotopography, 
namely, SOM, NanoMapper, and DynaSearch are reviewed and com- 
pared. The calibration result using identical samples is also shown. 


5.4.1 Introduction to General Equipment Used in 
the Measurement of Nanotopography 


How is nanotopography inspected and sorted? 
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FIGURE 5.27 NAND-flash cell threshold voltage shift resulting from wafer 
nanotopography. 
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FIGURE 5.28 Optical schematic of SOM. 


$) oT. 


Spot Spot 
Centered Offset 


FIGURE 5.29 Quantitative measurement of surface slope. 
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FIGURE 5.30 Slope magnitude map of nanotopography of polished 200 mm wafer. 


5.4.1.1 SQM™ (Surface Quality Monitor), from ADE, USA 


A. Laser beam reflectance-based bright channel technology (quad cell) 


1. 


AWIS-SOM: It is viewed as a next-generation multifunctional 
inspection tool; can be used in a 200 mm or 300 mm wafer; has 
multifunctional inspection capability; and is mainly used in 
measurement of particle, haze, and nanotopography. 


CR83-SQM: Currently used as the standard wafer substrate 
inspection tool worldwide. The WIS CR83-SQM provides wafer 
manufacturers the required nanotopography measurement 
capability for design rules down to 0.18 micron and better. It is 
the production tool for nanotopography inspection, sorting and 
surface mapping. Utilizing ADE's Quad Cell technology, the 
WIS CR83 provides height maps based on slope measurements 
and has a dynamic range from 0.5 mm to 10 mm. Its versatile 
software allows users to define four unique nanotopography 
bins for automated sorting by specifying a designated height 
change (nm) over a specified distance (spatial wavelength). It 
also allows users to view height maps at five selectable view 
scales. With these capabilities the WIS CR83 enables wafer 
manufacturers to simultaneously inspect for nanotopography 
features and monitor their production processes. 


B. SOMTM Technology: Bright Field Quad Cell 


e Surface nanometer scale topography height variation and 


location over whole wafer surface 


e Automated sort on user-defined parameters 
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Change in reflected angle = 2 x Change in surface slope 
Change in spot location = Focal length x Reflected angel 
Quad cell circuit converts spot location to voltage 

Extract surface slope distribution within a wafer 

Height map is calculated from the integration of the slope 


5.4.1.2 NanoMapper, from ADE Phase Shift, USA 


A. NanoMapper®: NanoMapper is a nanoscale science and engineer- 
ing (NSE) knowledge mapping system developed by the Artificial 
Intelligence Lab at the University of Arizona. NanoMapper 
enables users to search for patents (1976/1978-2006) or grants 
(1991-2006) by patent or grant number, keywords, and other data 
fields. NanoMapper also provides analysis tools. The National 
Science Foundation supported this research project. NanoMapper 
is an automated, precision surface mapping system available for 
research, analytical, and process control applications for 200 mm 
and 300 mm wafers. Using proprietary, optical interferometry 
from ADE Phase Shift, the system characterizes polished wafer 
surfaces by providing whole wafer topology measurements. 


1. Product highlights 
- Wafer nanotopography measurement 
— Measurement features: 
- Able to accurately repeat measurements 


— Measures 200 and/or 300 mm wafers, based on 
configuration 


- User editable recipes 

- Software features: 
— Measures nanotopography to the edge of the wafer 
- User editable recipes 


B. NanoMapper® FA (Figure 5.31): NanoMapper FA has all the 
analysis options of its R & D brother, but features full factory 
automation, including FOUP, SMIF, and open cassette capabili- 
ties. With NanoMapper FA, you can have: full edge-grip capa- 
bility, enhanced chuck performance, and additional SECS-GEM 
functionality. NanoMapper FA is an automated, precision surface 
mapping system available for research, analytical, and process 
control applications for 200 mm and 300 mm wafers. Using pro- 
prietary, optical interferometry from ADE Phase Shift, the system 
characterizes polished wafer surfaces by providing whole wafer 
topology measurements. 
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FIGURE 5.31 NanoMapper FA. 


1. Product highlights 


- Automated wafer nanotopography measurement 
— Measurement features: 

- Able to accurately repeat measurements 

- Subnanometer height resolution 
— Software features: 


- Measure nanotography to the edge of the wafer 


— Single-statistic pass/fail threshold curve analysis 
- User editable recipes 


C. Summary 
1. Unfiltered height data typically includes ~10 microns of topog- 
raphy for quality prime wafers. 


2. Spatial filtering removes the high amplitude long wavelength 
shape information to reveal nanotopography. 


— Spatial filter cutoff wavelength should be approximately 
twice the CMP length. 


- Nanotopography directly impacts post-CMP film thick 
ness variation. 


Measuring surface height directly, NanoMapper provides the subnano- 
meter sensitivity necessary to address the process development needs 
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for leading edge semiconductor device design rules down to 0.1 micron. 
Interactive 3-D graphics and powerful analysis software allow rapid visu- 
alization and quantification of polishing process effects. These effects 
include nanotopology defects that ultimately limit wafer usability dur- 
ing semiconductor device fabrication. The result is faster process devel- 
opment and precision process control during production, with reduced 
wafer scrap costs. 


5.4.1.3 DynaSearch, from Raytex, Japan 


DynaSearch (Figure 5.32) is an optical measurement system for evaluating 
the flatness of 200 mm and 300 mm wafers based on a unique, proprietary 
image processing algorithm. It views the entire surface of the wafer and 
gives real-time image data to derive the wafer topography from the angu- 
lar components of the skewed surface. The system is capable of evaluating 
both the wafer flatness and nanotopography (Figure 5.33). 

Using Raytex’s proprietary optical method, the DynaSearch performs 
scientific wafer topography measurement and inspection to quantify 
wafer flatness and topography. This one unit handles both the wafer 
flatness and the nanotopography measurements necessary to evaluate 
wafer-manufacturing quality. The Raytex’s EdgeScan edge inspection, 
BackScan backside inspection, and flatness and topography measurement 
performed by the Raytex DynaSearch are a winning product combination 
that enables users to create a total shipping inspection line. 


1. Features: 


e Performs high-resolution high-accuracy wafer topography 
measurement using Raytex’s proprietary optical measure- 
ment system 


FIGURE 5.32 (a) DynaSearch N4-800, (b) DynaSearch N4-1200. 
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FIGURE 5.33 (a) Flatness measurement, (b) nanotopography measurement. 


e Minimal impact due to vibration or other equipment 
environmental conditions unavoidable using conventional 
interferometry 


e Employs a stepper-type chuck for measurement to reproduce 
lithography flatness conditions in measurement 


Nanotopography 143 


2. Measurement items: 
e Flatness measurement 
e Nanotopography measurement 
3. Applications: 
e Outgoing/acceptance inspection of mirror and epitaxial wafers 
e Wafer shape evaluation before and after film deposition and 
epitaxial processes 
e Yield improvement of photolithographic process 
e Conditioning of grinding, polishing, diffusion, and epitax- 
ial growth 


5.4.1.4 Line Profile Comparison among Three Instruments 


The line profiles of height change along the x-axis of the wafer mea- 
sured by SQM, NanoMapper, and DynaSearch are superimposed on 
Figures 5.34a-e. Each of five measurements is done for different wafers. 
The filters applied to raw profiles are “Standard Filter” for SQM and 
“Gaussian Filter” for NanoMapper and DynaSearch. The cutoff length is 
set to 20 mm for all methods. From the overview of the profiles, the posi- 
tions of local peak and valley almost coincide for the three tools. However, 
SQM gives somewhat different profiles from the other two methods. 


5.4.1.5 Calibration among the Standard Deviations of Height 
Change Measured by Three Kinds of Instruments 


Figures 5.35a-c is the calibration among the standard deviations of height 
change profiles. As for the standard deviation, SQM gave smaller values 
and variations than NanoMapper or DynaSearch (see also Table 5.1). 


144 


Nanoparticle Engineering for Chemical- Mechanical Planarization 


FIGURE 5.34 Sensitivity comparisons of SQM, NanoMapper, and DynaSearch: (a) 
Measurement #1, (b) Measurement #2, (c) Measurement #3, (d) Measurement #4, (e) 
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FIGURE 5.35 Standard deviation comparison of SOM, NanoMapper, and DynaSearch: (a) 
SOM and NanoMapper, (b) NanoMapper and DynaSearch, and (c) DynaSearch and SQM. 
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STDEV Correlation: DynaSearch vs SQM 
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Novel CMP for Next-Generation Devices 


6.1 The Progress of Semiconductor 
Devices upon Current Demand 


Chapters 2 through 5 describe CMP process technology for semiconduc- 
tor device makers. This chapter will describe CMP planarization technol- 
ogy for emerging devices and materials. From the point of view of the 
researcher whose specialty is detailed (or accurate) processing, the advent 
of CMP technology has diverse meaning. Silicon wafers on devices are 
an unapproachable part to the mechanic processing technician. Because 
of complex factors, the structure of a devices circuit is becoming highly 
integrated and super miniature. As a result, the degree of rugged process- 
ing surface falls under nano scale because device processing and machine 
processing have very different materials of processing. Devices are shrink- 
ing to under 60 nm; therefore, new material and structure processing are 
appearing. To adapt to this condition, more progressive CMP processing 
is required and new structures and materials will arrive successively only 
through CMP processing. 

The memory industry is led by metal-oxide semiconductor (MOS) 
memory, which is largely divided into volatile and non-volatile memory 
(Figure 6.1). The example of volatile memory is dynamic random access 
memory (DRAM), whereas non-volatile memory is representative of flash 
memory. DRAM and flash memory developed rapidly due to the demands 
for high speed and high capacity devices such as computers, digital cam- 
eras, mobile phones, and MP3s (Figure 6.2). New structures and materials 
were used for improving the performance of these devices. CMP process- 
ing has faced a new challenge as well. 

Even though the integration of DRAM and flash memory is increasing 
during this brilliant growth, it is expected that it will hits its limits. To 
overcome its limit, instead of MOS memory, the latest non-volatile memory 
research includes phase-change random access memory (PRAM), nano- 
floating gate memory (NFGM), polymer random access memory (PoRAM), 
and resistance random access memory (ReRAM). But similar to MOS mem- 
ory, these new non-volatile memory systems cannot increase integration and 
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form multilayer without planarization processing. This chapter introduces 
DRAM under 60 nm of MOS memory, CMP processing applied to NAND 
flash, and CMP processing for next-generation memory producing. 


MOS Memories 
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Random Access 
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FIGURE 6.1 Classification of MOS memories. 


Year of Production 2005 2006 2007 2008 2009 2010 2011 2012 2013 

DRAM % Pitch (nm) (contacted) 80 70 65 57 50 45 40 36 32 

|MPU/ASIC Metal 1 (MI) % Pitch m == a 33 i ae 40 y ES 

(nm) 

[MPU Physical Gate Length (nm) 32 28 25 23 20 18 16 14 13 

DRAM Product Table 

Cell area factor [a] 8 8 8 6 6 6 6 6 6 
IS |Cell area [Ca = af2] (um?) 0.051 0.041 0.032 0.019 0.015 0.012 | 0.0096 | 0.0077 | 0.0061 


Cell array area at production (% of 


ee. 63.00% | 63.00% | 63.00% | 56.08% | 56.08% | 56.08% | 56.08% 
chip size) § 


56.08% 56.08% 


Generation at produciton § 1G 2G 2G 2G 4G 4G 4G 8G 8G 
Functions per chip (Gbits) 1.07 215 2.15 2.15 429 4.29 4.29 8.59 8.59 
Chip size at production (mm2)$ 88 139 110 74 117 93 74 117 93 
Gbits/em2 at production$ 122 154 194 291 3.66 4.62 5.82 7.33 9.23 
Flash Product Table 

Elash 16 Pitel (min) (Unscontacted 75.7 63.6 56.7 50.5 45.0 40.1 35.7 318 28.3 


Poly)(f) 
Cell area factor [a] 


IS [Cell area [Ca = af] (um?) 
Cell array area at production (% of 


67.5% 67.5% 67.5% 67.5% 67.5% 67.5% 67.5% 67.5% 67.5% 


chip size) $ 

Generation at production $ SLC 4G 4G 4G 8G 8G 8G 16G 16G 16G 
Generation at production § MLC 8G 8G 8G 16G 16G 16G 32G 32G 32G 
Functions per chip (Gbits) SLC 4.29 4.29 4.29 8.59 8.59 8.59 17.18 17.18 17.18 
Functions per chip (Gbits) MLC 8.59 8.59 8.59 17.18 17.18 1718 | 3436 | 3436 34.36 
Chip size at production (mm2)$SLC| — 144 101.8 80.8 128.3 101.8 80.8 1283 101.8 80.8 

ay ; 2 

ae size at production (mm“)§ 144 101.8 80.8 128.3 101.8 80.8 128.3 101.8 80.8 
Bits/cm2 at production $ SLC 3.0E+09 | 4.2E+09 | 5.3E+09 | 6.7E+09 | 8.4E+09 | 1.1E+10 | 13E+10 | 1.7E+10 | 2.1E+10 
Bits/cm2 at production $ MLC 6.0E+09 | 8.4E+09 | 1.1E+10 | 13E+10 | 17E+10 | 2.1E+10 | 2:7E+10 | 3.4E+10 | 43E+10 


FIGURE 6.2 Technical roadmap for DRAM and NAND flash memory. 
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6.2 Complementary Metal-Oxide 
Semiconductor (CMOS) Memory 


Memory semiconductors should be of high capacity through high-inte- 
grated circuits. DRAM cell factor evolved from 8F? into 6F?. Along with 
expectations of 45 nm in 2010 to 32 nm in 2013, there is an anticipation that 
capacity per chip will increase fourfold like 2.15 Gbit to 8.59 Gbit. Flash 
memory keeps the design rule of 4F?and when future technology passes 
through 56.7 nm to 28.3 nm, the capacity per chip will increase rapidly 
from 8 G to 32 G. 

Fulfilling multilayer and miniature structures of memory devices led 
to the introduction of new materials and structures. For the structure, the 
design rule decreases less than 70 nm and the short channel effect (SCE) 
phenomenon appears to have a bad influence on the device drive if exist- 
ing planar transistor (TR) is applied. To solve this problem, studies are 
in progress to apply recessed channel array TR and three-dimensional 
structured FinFET in DRAM and floating gate, twin SONOS, and FinFET 
SONOS in flash memory (Figure 6.3). 

New materials are applied to maximize the capacity of device. To 
increase the capacitance of cap used in DRAM, studies about high-k 
dielectric material are in process. Flash memory uses a gate material with 
polysilicon by reason of high speed and stable storage. To reduce semicon- 
ductor device RC delay, Cu metal lines and low-k are being introduced. 
This section represents the concept of CMP processing being introduced 
to DRAM and NAND flash devices. 
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FIGURE 6.3 Roadmap for transistor structures of DRAM and NANA flash memory. 
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A RCAT structure of three-dimensional recess channel array transis- 
tor was developed for the device memory to solve the problem of short 
channel effect, a demand of integration. The surface of device proces- 
sion reduces and channel makes recess to lengthen the valid length. As 
a result, short channel effect can be reduced. Future technology under 
60 nm will anticipate the partly insulation FET (PiFET) structure to the 
part of channel. PiFET that are drawn to the concept of body-tied SOI 
limits the junction depth by dielectric layer to easily form ultra shallow 
junction. It also has a small amount of leakage current and it has a merit 
of SOI that short channel effect is almost nonexistent. This is a device to 
diminish the production cost. DRAM of future technology under 45 nm 
is anticipated to have a MBCFET device structure of metal-oxide semi- 
conductor field-effect transistor (MOSFET), which has multichannels by 
passing through FinFET. This device makes driving current larger and 
has a great advantage of current resistance capability of a gate because it 
has a gate all-around (GAA) structure. The self-aligned STI processing, 
which is a method of formation of floating gate into a device less than 
60 nm, enlarges the surface of the floating gate. Therefore, it can increase 
coupling ratio. To avoid the coupling phenomenon, floating gates to the 
active area are aligned correctly; that is self-aligned poly Si floating gates 
fabricated by the CMP process are necessary to overcome the misalign- 
ment between the active area and the floating gate in cell arrays. 


6.2.1 Noble Metal CMP for DRAM 


Existing SiO, used in gate dielectric faces its limit because thickness of 
thin film becomes thinner by the integration of the semiconductor device. 
Indeed, power dissipation exceeds the standard rate because of the leakage 
current by tunneling of carrier by electric field is increased. Consequently, 
it has the same EOT electronically. High-k dielectric makes possible the 
embodiment of thickness of thin film without tunneling, is physically 
interested. 

With using high-k dielectric film, simplifying the cell structure and for- 
mation process is the most efficient method to ensure sufficient capaci- 
tance for the high-integrated capacitor's role in narrow surfaces like 
miniaturizing the next generation of DRAM. Previously used dielectric 
film in low-k materials are NO of SizN,¿/SiO, and ONO of SiO,/Si¿N¿/SiOX. 
For the next-generation capacitor, the high-k materials mainly used are 
ALO, HfO,, ZrO,, Ta,O;, BST, and STO, as well as other materials such as 
HfSiOX and ZrSiOX. In semiconductor industry, high-k thin films form 
film by chemical vapor deposition (CVD). In relation to this, the study and 
developments of CVD precursors are as follows. 

Capacitor using HfO, base dielectric film, applies TiN by top/bottom 
electrode. However, capacitor technology under 50 nm needs to develop 
new dielectric material and electrode material. So, noble metals, such as 
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Ru, Pt, and Ir, are being researched as new electrode materials. In the case 
of the noble metal, which is a stable material, it is not easy to form a capaci- 
tor by using etch back or CMP processing. Noble metal CMP processing, 
including Ru, should oxidize the surface of polishing target film for the 
polishing process like other metal CMP. 

In Ru CMP, polishing stops when oxide, dielectric between electrodes, 
exposures while Ru is polished. The removal rate of Ru is under the con- 
trol of an oxidizer added into slurry. The Pourbaix diagram of Ru shows 
that RuO, and Ru,O, exist when Ru oxidizes, and the removal rate of noble 
metal CMP depends on the degree of oxidization (Figure 6.4). However, in 
the real situation, there is no oxidizer that can secure the safety of slurry 
and strongly oxidize noble metal at the same time. Moreover, for success- 
ful Ru CMP, polishing selectivity of Ru:oxide should be considered with 
STI CMP processing. 

In this difficult situation, the reason for applying noble metal CMP is 
closely related to the electronic characteristics of a device. For example, Ru 
CMP makes it possible to produce a capacitor to have a higher capacitance 
than with dry etching because it loses less than dry etching and it has a 
clear pattern formation as in Figure 6.5. That is, the electronic character- 
istic can be improved when Ru CMP applies to a device process. In fact, 
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(H,RuO;) 


Weak Passivati 
(RuO,H,O) 


Corrosion Potential (mV) 
Ru R/R Increase 


0 2 4 6 8 10 12 14 
pH 
FIGURE 6.4 Electrochemical characterization of various oxidizers in Pourbaix diagram of 
Ru, obtained from Tafel plots and pH of the slurries. 


FIGURE 6.5 SEM images after (a) Ru CMP and (b) Ru dry etching process. 
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FIGURE 6.6 Comparison of leakage current of RIR capacitors node separated by Ru CMP 
and Ru dry etching. 
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FIGURE 6.7 Positive and negative leakage current of RIR capacitor at 1V and —1V. 


when processing by dry etching, the leakage current is 6.537 fA at 1V. As 
shown in Figure 6.6 and Figure 6.7, the value is remarkably low when a 
device is processed by CMP. In addition, cell capacitance is 13.4 fF/cell; the 
CMP process makes it possible to get a much higher value than dry etch- 
ing, which is 8.4 fF/cell. 


6.2.2 Poly Si CMP for NAND Flash Memory 


8 G NAND flash memory connects 32 cells in a sequence without source 
and drain contact each cell. Two transistors of SSL (source select line) and 
GSL (ground select line) are connected in a series between bit-line con- 
tact and CSL (common source line). Floating gate and control gate exist in 
each cell and is formed channel by controlling the voltage of control gate 
(Figure 6.8a). Electron that transfers this channel becomes tunneling to be 
accumulated at the control gate to be a role of memory. The state of stored 
electron at the floating gate through F-N tunneling is the program state. 


Novel CMP for Next-Generation Devices 155 


Active | 
Silicon | 


| 
| 
| 


Active 
Silicon 


(b) 
FIGURE 6.8 (a) Cross-sectional SEM image of a 90-nm NAND-flash memory-cell with- 
out CMP. (b) Cross-sectional TEM image of a sub-60 nm structure with interpoly ONO 
dielectrics. 


Missing electron state is the erase. Programming and erasing are formed 
according to the size of Va of cell transistor. 

So, the Vof a device rises at programming time. The state of program/ 
erase can be confirmed through the difference of Vp. The augmentation 
of integration is easily achieved because the simplicity of the structure of 
NAND flash memory and scaling down are easy. Recently, the emergence 
of multilevel cell (MLC) makes to have a higher integration. This reduction 
of design rule might make it possible to be at least 40 nm. Because of the 
diminution of charge loss tolerance based on scaling down by word-line 
applied voltage with the decreasing of coupling ratio, interference of float- 
ing gate becomes a severe problem. Coupling ratio suddenly decreases 
at the 60 nm level. Formation processing of the existing floating gate and 
self-aligned CMP process, like Figure 6.8(b), are brought in because there 
is no space to increase the size of floating gate that can enlarge the cou- 
pling ratio. 
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With a method of forming floating gate into a 60-nm level device, when 
polysilicon CMP is used in connection with self-aligned STI process, the 
surface of floating gate can be extended like in Figure 6.8b. This leads 
to an increased coupling ratio. Polysilicon CMP is an absolutely neces- 
sary process for improving the capacity of flash memory. When slurry, 
which has high removal selectivity to polysilicon and oxide, makes prog- 
ress CMP processing, polysilicon recess is occurred by dishing, which is 
CMP characteristics. Or electrical characteristics inside cells, including 
Vu, can become worse by thickness distribution of floating gate, which 
occurs by CMP non-uniformity. The voltage of floating gate is capacitance 
based on ONO (oxide—nitride—oxide) between control and floating gate. 
Crotl= Cono + Crunnet) 18 capacitance based on Tox and capacitance based on 
ONO is decided according to the voltage of control gate. 

To obtain a high coupling ratio, the value of ONO capacitance should 
be increased. Capacitance needs to maximize the selective surface above 
floating gate. But the width of floating gate, which is a factor for deciding 
a selective surface, is determined when the design rule was set. The height 
of the floating gate will be a key factor to determine the coupling ratio in 
the end. 

Furthermore, stored electron at the floating gate rapidly decreases in 
accordance with the decreasing of design rules. In case of SLC (single-level 
cell), electron loss less than 20% is allowed within the same range. In case 
of MLC, each program state has to have the same Vp range. To reduce the 
tolerance limit of electron loss per each state, control gate and capacitance 
of floating gate should be increased; however, scaling down of interpoly 
ONO endsits limit. Another problem is that the gap between cells becomes 
narrow according to the increasing of cell integration, and an interference 
phenomenon will arrive by capacitive coupling between floating gates. The 
transition of cell V,, by interference of floating gate brings up the V,, transi- 
tion of around cell. So, when V, changed in the phase of programmed one 
cell, Vy is changed due to the coupling phenomenon in which electrode is 
accumulated in floating gate by writing to the next cell. This change of Vu 
becomes a severe problem at the MLC action, which should keep the Va 
gap between cells. Voltage of floating gate is influenced by control gate of 
around cell and floating gate, and is not only influenced by the voltage of 
control gate of the corresponding cell. 

As described earlier, by increasing integration, coupling ratio of floating 
gate, permitted possible charge loss, and V,, shift by capacitive coupling 
of intercells become problems. Especially, due to narrowing the space 
between bit line and word line, V,, shift of cell dramatically increases 
around 0.2 V to make higher Vp distribution through interference of float- 
ing gate between cells. To decrease this interference, the space between 
cells fills with low-k material or reduces the height of floating gate. At 
the same time, exact thickness control of floating gate should be needed. 
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Especially, the height of floating gate should be controlled through the 
CMP process because that greatly influences word-line and bit-line cells. 

To achieve the isolated poly floating gate, the polishing should be 
stopped at the oxide film. Therefore, the poly-to-oxide removal selectivity 
is the most important factor for the poly isolation CMP process. Poly isola- 
tion CMP without selectivity induces dishing and rugged topography on 
surfaces, which result in deterioration in the quality of interlayers in the 
device. 

The two steps of poly Si CMP are shown in Figure 6.9. The first step is 
to remove Si, then polishing at oxidize film is to be stopped in the sec- 
ond step. The polishing of Si needs an addition for accelaration polishing 
because it should be progressed over 2000 Á per minute. This additive is 
mainly a chemical compound that has an amine group and abrasives pol- 
ish the Si film by the complex formation of Si surface and amine group. 
For the last step, lower the polishing rate of oxidized film to less than 100 
A/min for stopping polishing. 

The difference of hydrophobicity between poly Si and oxide film is the 
key factor to achieving the high removal selectivity in the poly isolation 
CMP process. As oxide film is more hydrophilic than silicon film, hydro- 
philic polymer is preferred to be adsorbed on the hydrophilic oxide film. 
By utilizing selective adsorption of hydrophilic polymer, the passivation 
layer formed on oxide film can prevent the direct contact of abrasive par- 
ticles, which results in suppression of the removal rate of oxide film dur- 
ing poly isolation CMP process (Figure 6.10). 


1 Oxide Deposition 4 Amorphous Si Deposition 
in-situ PHg doping, dose:1.5 x 1010 eal? 
Oxid : T 
e 0.3 um Amorphous-Si Deposition ; 
Nitride ; 150-200 nm ii 
Pad Oxide e! ees Gap Fill = 5nm 
Oxide 


10-1 Gap Fill 
0.5 um Oxide 


Si Substrate Si Substrate 


2 STI CMP 5 Poly Isolation CMP 


Nitride 7 |_ 150-200 nm - sei 
o — ar E Gap Fill L 


Oxide Oxide 


Si Substrate Si Substrate 


3 Nitride Strip with Phosphoric Acid 6 Forming of Poly Si by Annealing 
Il 910°C RTA: 15secN3 


f Nitride Strip 
Gap Fill bid Onide Gap Fill 
Oxide Oxide 


j| Floating Gate 


85 nm 


Si Substrate Si Substrate 


FIGURE 6.9 (See color insert) Schematic process flow of the poly isolation CMP process. 
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FIGURE 6.10 (See color insert) Mechanism for the poly isolation CMP process. 
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FIGURE 6.11 XPS spectrum of poly Si and oxide film at pH 10. 
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To investigate the surface structures of poly Si and the plasma-enhanced 
tetraethylorthosilicate (PETEOS) films, Si 2p XPS analysis for each film 
was conducted as shown in Figure 6.11. The Si2p spectra of poly Si film 
in Figure 6.1la shows two symmetrical profiles, of which binding ener- 
gies correspond to 98.7 eV for Si and 102.5 eV for SiOH. Compared with 
the intensity of Si peak, that of SiOH is considerably low, which mean the 
surface structure of poly Si film consists of Si structure for the most part. 
On the other hand, as shown in Figure 6.11b, the spectrum observed from 
the PETEOS film was mainly separated into two symmetrical profiles. 
The binding energies observed from this result are 102.50 and 103.30 eV, 
which correspond to SiOH and SiO, respectively. The intensities of the two 
peaks are almost similar and this observation indicates that a respectable 
amount of siloxane group is present on the surface of PETEOS film. 

The difference originated from hydrophilic and hydrophobic character- 
istics is confirmed from contact angles of deionized (DI) water on each 
surface. The contact angle of DI water on poly Si film was 61.2” in the 
Kruss contact angle measurement system. On the other hand, DI water 
drop exhibited a low contact angle below 10%, thus spread out on the oxide 
wafer as shown in Figure 6.12. This result is coincided with the XPS result; 
that is, the surface of SiO, is more hydrophilic than that of poly Si due to 
siloxane (=Si-O-Si=) bonding. 

Figure 6.13 shows the adsorption isotherms for PAM on poly Si and SiO, 
as a function of PAM concentration. Adsorption of PAM on oxide sur- 
faces increases and reaches a plateau level of approximately 0.23 mg/m?. 
However, PAM is scarcely adsorbed on poly Si surfaces. This is driven by 
the difference in hydrophobicity, which affects the interaction between 
PAM and each surface. At high pH, the negative site MO- of metal oxide 
(MO) surface bonds with the weakly acidic NH, function. Therefore, the 
interaction between SiO- of SiO, surface and NH, group of PAM led to the 
selective adsorption of PAM on SiO,. 
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FIGURE 6.12 Contact angle of poly Si and oxide film at pH 10. 
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FIGURE 6.13 Adsorption isotherms of PAM on poly Si and oxide at pH 10. 


Adhesive forces were measured to analyze the interaction between Si 
atomic force microscope (AFM) tips and the deposited films from the 
force-distance measurement with the AFM. The force-distance curve is 
expected to elucidate the adsorption behavior of PAM on the deposited 
films. In the force-distance measurement, there is no interaction until the 
tip is close enough to be attracted to the surface. As the tip approaches 
to the surface, it contacts with the surface. After the contact, the tip is 
retracted from the surface, the cantilever is bent, and a repulsive force 
(positive) is measured. When the tip is being retracted, an attractive force 
is measured (negative). When the critical force is reached, the tip is sepa- 
rate from the surface and this point is called the pull-off point. Therefore, 
the pull-off point, which corresponds to the point of the critical force, is 
determined by the degree of the adhesive force between the tip and the 
surface. The higher the adhesive force between the tips and the films, the 
lower the pull-off point. Figure 6.14a shows the force-distance curves of 
the tips and poly Si film at pH 10 as a function of the PAM concentra- 
tion. In the absence of the absorbed PAM molecule, an adhesive force was 
observed at approximately 20 nm of separation distance. It is of interest 
that there is no significant difference between the surface forces of the tip 
and poly Si film even with the presence of PAM. This result is almost the 
same for all samples, irrespective of the concentration, which means that 
PAM is scarcely adsorbed on poly Si film. 

On the other hand, Figure 6.14b shows the force-distance curves of 
the tips and the oxide film at pH 10 as a function of the PAM concentra- 
tion. In the absence of the absorbed PAM molecule, an adhesive force was 
observed at approximately 30 nm of separation distance. However, it was 
found that the adhesive force disappeared by the addition of PAM due 
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FIGURE 6.14 Force-distance curve between the AFM tip and the surface of wafer: (a) poly 
Si, (b) oxide. 


to the interaction between the siloxane bonding of the oxide film and the 
amine group of PAM at pH 10. This result is more clearly observed as the 
concentration of PAM increases. Therefore, it is expected that the presence 
of PAM causes the suppressed removal rate of the PETEOS film by the 
formation of the passivation layer on the oxide film and the increase in the 
concentration of PAM leads to the decrease in the oxide removal rate. 
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FIGURE 6.15 Removal rate of poly Si and oxide film. 


The results of the CMP performance as a function of the concentration 
of PAM are shown in Figure 6.15. As the concentration of PAM in poly 
isolation CMP slurry increased, the slurry for poly isolation CMP process 
had a higher removal rate for the PETEOS film, reduced from 82 A/min 
without PAM to 12 A/min with 0.06 wt% of PAM. It is considered that 
this decrease in the removal rate of the PETEOS film as a function of the 
concentration of the PETEOS film is due to the formation of the polymer- 
coated layer on the PETEOS film caused by the high affinity of PAM to 
the PETEOS film. From the adsorption isotherms shown in Figure 6.13, 
the PETEOS film has high affinity to PAM, which induces PAM to adsorb 
on the surface of the PETEOS film. The PAM layer formed on the PETEOS 
layer reduces the possibility of the penetration of the abrasive particles 
to the PETEOS film and also decrease the friction force between abrasive 
particles and the PETEOS film. On the other hand, the removal rate for 
poly Si film remained at 2190 A/min without PAM and 2178 A/min with 
0.06 wt% of PAM. Compared with the decrease of the removal rate for 
the PETEOS film, the change of the removal rate for poly Si film is small 
enough to ignore. It is expected that PAM scarcely adsorbs on poly Si film 
due to the low affinity between them as shown in Figure 6.13. Therefore, 
the PAM layer on poly Si is not thick enough to prevent the penetration 
of the abrasive particles to poly Si film even though the concentration of 
PAM is increased. Therefore, it is expected that poly Si film is directly 
exposed to the mechanical polishing during the poly isolation CMP pro- 
cess and it results in the small change of the removal rate of poly Si film 
with addition of PAM. As a result, the difference in the removal mecha- 
nism of each film achieves the drastic improvement in poly-Si-to-oxide 
selectivity values from 26.7 to 181.8 as the concentration of PAM increases 
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from 0 to 0.06 wt%. Consequently, the formation of a polymer-coated layer 
by the selective adsorption of PAM on the PETEOS film plays a dominant 
role to improve poly-Si-to-oxide selectivity in poly isolation CMP. One can 
conclude that the control of selective adsorption of polymer regarding 
the characteristics of target films is a key technique to achieve poly-Si-to- 
oxide selectivity in the poly isolation CMP process. 


6.3 Novel CMP for New Memory 


Recently, flash memory—used in mobile phones, MP3s, digital cameras, 
and USBs—is a non-volatile memory device that solved a weak point of 
DRAM device of volatile movement. NAND flash memory in business 
today is possible up to 2 gigabyte in integrated diagram. It has a charac- 
teristic of action at high supply voltage of 10 to 15V. 

Flash memory device applies threshold voltage transition of transistor 
with memory motion theory to floating gate made by polysilicon through 
the accumulating or discharge of electric charge. With the interaction of 
flash memory device structures, unbalanced polysilicon grain-size dis- 
tribution increases threshold voltage distribution. The movement above 
10 V voltage generated from devices is increasing. The technical limit of 
integration is known as 16 gigabit level. Therefore, there is a need for the 
next-generation non-volatile memory devices that are several integrations 
over 64 gigabit with tens of nanosecond write/erase time (Figure 6.16). 
Research has progressed in the areas of phase-change random access 
memory (PRAM), nano-floating gate memory (NFGM), polymer random 
access memory (PoRAM), and so on (Figure 6.17). 


6.3.1 GST CMP for PRAM 


Phase-change random access memory (PRAM) has been intensively 
studied as one of the candidates of a non-volatile memory to challenge 
conventional memories such as DRAM and flash memory, due to its fast 
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FIGURE 6.16 (See color insert) Comparison of new memory and conventional memory. 
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FIGURE 6.17 Classification of memory. 


switching speed, good endurance, and compatibility with CMOS logic 
process. PRAM stores data according to the resistance change of chal- 
cogenide (GeSble, GST) materials that is basically caused by the phase 
change of chalcogenide materials, and the phase change is followed by heat 
flux from the bottom electrode (BE) to contact dimension. Chalcogenide 
materials exhibit at least two states. The states are the amorphous and 
crystalline states, and transitions between these states may be selec- 
tively initiated. The amorphous state generally exhibits higher resistivity 
than the crystalline state. The phase change may be induced reversibly. 
Therefore, the memory may change from the amorphous to the crystalline 
state and may revert back to the amorphous state thereafter in response 
to temperature changes. In effect, each memory cell may be thought of as 
a programmable resistor, which reversibly changes between higher and 
lower resistance states. The phase change may be induced by resistive 
heating. As shown in Figure 6.18, PRAM manufacturing applies the CMP 
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FIGURE 6.18 Schematic drawing of GST CMP process for on-axis confined structure in 
PRAM device. 
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technique to planarize the GST layers deposited on the front surfaces of 
wafers. In particular, to scale down PRAM beyond 256 Mbit (design rule 
less than 100 nm), achieving the writing current level of several hundred 
uA is an essential parameter. Therefore, GST was on-axis confined within 
a small pore, which may result in high planarity of GST surface. The SiO, 
film acts to isolate the adjacent cell with GST materials, and as a barrier 
that stops the polishing process immediately after complete removal of 
the GST materials. Besides, in the fabrication of PRAM beyond 256 Mb, the 
ring-shaped contact structure for small BEC formation is very important 
to process technologies. The writing current flows through the perime- 
ter of the ring-shaped contact instead of the whole body of the contact. 
Obviously, the effective contact area of the ring-shaped contact has less 
dependency on the contact diameter because it is linearly proportional to 
the diameter of the defined contact. Thus the ring-shaped contact struc- 
ture has robustness against the contact size variations. The CMP slurry 
needs to perform a high selectivity (>50:1) of polishing rate between GST 
and SiO, films. CMP slurry is composed of colloidal silica abrasive, sur- 
factant, organic chemical, alkaline agent, titrant, and deionized water to 
control the removal selectivity of GST-to-SiO, films and TiN-to-SiO,. 

For CMP evaluation, 8-inch silicon wafers with a multilevel structure 
of NGST/SiO,/Si were used. The as-deposited NGST film had an amor- 
phous structure. The NGST film was deposited using a metal organic 
precursor under a nitrogen atmosphere at 350*C, giving a composition of 
approximately 25:23:52 (Ge:Sb:Te) by atomic percentage, as shown by the 
cross-sectional TEM energy-dispersive x-ray spectroscopy (EDS) analysis 
results listed in Figure 6.19. The bottom oxide film was deposited by the 
PETEOS method at 720*C. The thicknesses of the as-deposited NGST and 
oxide films were 200 and 100 nm, respectively. The films were polished 
on a CMP system (6EC, Strasbaugh, USA.) with a single polishing head 
and a polishing platen. We used an industry-standard CMP polishing pad 
(IC1000/Suba IV, Rohm and Haas Electronic Materials, USA.). The thick- 
ness variation of the NGST films on the wafers before and after CMP was 
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FIGURE 6.19 TEM energy-dispersive x-ray spectroscopy (EDS) analysis results. 
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measured with a spectroscopic phase-modulated ellipsometer (UVISEL, 
HORIBA Jobin Yvon, Japan). The oxide film thickness before and after 
CMP was estimated with a tabletop film analysis system (Nanospec 180, 
Nanometrics, USA). The contact angle was measured with a contact angle 
meter (DIGIDROP, GBX, France). The chemical bonding characteristics 
of the NGST film surface after dipping in the slurry were characterized 
by x-ray photoelectron spectroscopy (XPS) using an ESCA 2000 (V.G. 
Microtech, UK) system with a monochromatic Al Ka x-ray monochro- 
matic source (linewidths = 0.85 eV, energy = 1487 eV). 

Figure 6.20a shows the polishing rate of NGST film as a function of the 
tetramethylammonium hydroxide (TMAH) concentration. Without TMAH 
in the slurry, the polishing rate was 6 nm/min. With the TMAH addition, 
however, the NGST polishing rate drastically increased up to 242 nm/ 
min at a TMAH concentration of 0.12 wt%. This striking difference in the 
polishing rates with and without TMAH resulted from the chemical reac- 
tions between the NGST film and the TMAH, which we explain later in 
detail. Beyond a TMAH concentration of 0.12 wt%, however, the polishing 
rate of NGST slightly decreased. This behavior was related to the way that 
TMAH effectively influences chemical reactions at the NGST film surface, 
such as hydrophobic or hydrophilic interactions. The characteristics of the 
reaction of TMAH with the NGST film surface can be estimated from the 
surface tension by measuring the contact angle. In general, wettability can 
be quantitatively evaluated in terms of the spreading coefficient, which is 
the energy difference between the solid substrate and the contacting liq- 
uid phase. The interaction of the interfacial tensions at the liquid-vapor- 
solid junction is described by the Young equation as follows: 


cos) = T (6.1) 
LV 


where 9 is contact angle (0 > 0°), and rsv, Ysy, and ra are the effective inter- 
facial tensions. 

After dipping the NGST film in the slurry for 1 min at 45°C, the contact 
angle of the NGST film surface was measured as a function of the TMAH 
concentration, as shown in Figure 6.20b. The contact angle decreased dras- 
tically with increasing TMAH concentration up to 0.12 wt% and then grad- 
ually increased. The contact angle results shown in Figure 6.20b coincided 
with the inverse of the NGST film polishing rate curve in Figure 6.20a. The 
hydrophilicity indicated by a small contact angle means that adhesion 
of TMAH molecules on the NGST film surface, more so than cohesion, 
plays a dominant role in enhancing the impact probability of molecules 
for the chemical reaction to etch the oxidized surface of the NGST film. 
The enhanced impact probability results in an accelerating chemical and 
mechanical reaction speed between the colloidal silica abrasives and 


Novel CMP for Next-Generation Devices 167 


“4UUU iU 
3500 
E 3000 18 
sE 
© = 2500 e — 
D — a 6 = 
£ £2000} a £ 
x = — a E o 
@ | 1500} la £ 
ou 
a. © 1000+ o 
© e 2 12 
500 | ee, 
ol @ 
fi A 1 A 1 A 1 A 1 A 1 0 
0 0.06 0.12 O18 03 0.5 
TMAH wt% 


(a) 


(b) 


FIGURE 6.20 (a) Polishing rate of NGST film as a function of the TMAH concentration, (b) 
contact angle of the NGST film surface. 


the NGST film surface during CMP. As a result, a slurry with enhanced 
hydrophilicity initially produces a higher polishing rate. Note that TMAH 
molecules initially etch off the oxidized surface of the as-deposited NGST 
film, and the exposed, unoxidized NGST surface is then chemically oxi- 
dized, resulting in a hydrophilic surface. The hydrophilicity of the NGST 
film surface, however, becomes progressively lower once the TMAH con- 
centration exceeds a specific value (0~0.12 wt% in this experiment). This 
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is probably associated with the formation of a polymer layer enhanced by 
excessive TMAH molecules on the NGST film surface, making the surface 
less hydrophilic and suppressing the direct mechanical contact between 
the colloidal silica abrasives and the unoxidized NGST film surface. As a 
result, the polishing rate of NGST film decreases with increasing contact 
angle. These results for NGST film CMP—namely, the relation between 
the polishing rate and contact angle—are the same as those that we previ- 
ously reported for polysilicon CMP. Therefore, this study has provided 
key information on using TMAH in colloidal silica slurry to obtain a high 
polishing rate of NGST film during CMP. 

We conducted an additional dipping test to confirm the validity of the 
contact-angle mechanism of NGST film CMP. The as-deposited NGST 
film was dipped into the slurry for one minute at 45*C. After dipping, the 
chemical composition of the NGST film surface was characterized by XPS, 
which is one of the best ways to examine the chemical binding character- 
istics of film surfaces and further explore the chemical reaction between 
TMAH and the NGST surface. Figure 6.21a shows the XPS spectra for Ge 
2p. Before dipping, there was a strong GeO, peak at 1219.8 eV; after dip- 
ping, this peak was weakened. Thus, TMAH is a strong etchant of GeO, 
on the NGST film surface. Figure 6.21b shows the XPS spectra for Sb 3d. 
The Sb 3d*? peaks for Sb,O; and metallic Sb bonding occurred at 539.8 eV 
and 537-538 eV, respectively. The Sb 3d”? peaks for Sb,O; and metallic Sb 
bonding occurred at 530.4 eV and 529-530 eV, respectively. Here, Sb metal- 
lic bonding refers to Sb-Ie or Sb-Ge bonds and the formation of Sb clus- 
ters in the film. Before dipping, the peak position for Sb was between the 
Sb homopolar and metallic bonds, so there could be equivalent numbers 
of Sb homopolar and Sb—Te or Sb-Ge bonds in the amorphous NGST film. 
After dipping, the intensities of the Sb 3d% and Sb 3d*? peaks for Sb,O; 
decreased significantly. This indicates that Sb,O, was easily etched by the 
TMAH-based slurry. In the case of tellurium oxide, the Te 3d*? peaks for 
Te metallic bonding and Te oxide bonding occurred at 572.5-574 eV and 
576-577 eV, respectively. The Te 3d”? peaks for Te metallic bonding and 
Te oxide bonding occurred at 583-584 eV and 586-587 eV, respectively. 
As shown in Figure 6.21c, there was no significant difference in the XPS 
spectra for the Te 3d peaks before and after dipping. This indicates that Te 
oxide bonding is stronger than the oxide bonding of Ge and Sb. 

The results suggest a possible CMP mechanism for NGST film. The 
TMAH initially etches off the oxidized film surface (particularly GeO, and 
Sb,O;), and the colloidal silica abrasives then directly contact the unoxi- 
dized NGST film surface to perform mechanical polishing. In addition, 
hardness measurement of the NGST film showed that its hardness (3.2 GPa) 
was approximately three times smaller than that of SiO, film (9 GPa). As a 
result, the colloidal silica abrasives in the TMAH-based slurry significantly 
enhance the polishing rate of NGST film by etching off the oxidized surface 
and applying direct mechanical polishing to the soft NGST film surface. 
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FIGURE 6.21 The XPS spectra of NGST film for (a) Ge 2p, (b) Sb 3d, and (c) 
Te 3d. 
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FIGURE 6.22 The structure of ReRAM. 


6.3.2 Novel CMP for ReRAM 


ReRAM can produce a device by 1T/1R or 1R structure (Figure 6.22). In 
the case of producing by 1T1R structure, after deposition of a resistance 
change material except for CMP for STI, the CMP process is considered. 
ReRam is not sure about its effect regarding contact-area dimension like 
PRAM. If it is not produced by extremely confined structure, the use of 
CMP process depends on the processing choice. 

Recently, in the new ReRAM non-volatile memory, a nickel oxide film 
with titanium (Ti:NiO) was doped to increase voltage for memory erasures. 
Operations required only 5 ns, about 10,000 times faster than before, and 
with resistance fluctuations reduced to 1/10 that of conventional ReR AMs. 
Optimizing the voltage applied to a transistor reduced the current need to 
erase memory to <100 pA. 

The prototype ReRAM device offers low fluctuation of resistance value 
even during high-speed operation. The ReRAM technology “is amena- 
ble to miniaturization and can be manufactured inexpensively,” so it is 
seen as an alternative to flash, the company explained. “If further minute 
non-volatile memory can be realized using ReRAM, there is potential for 
higher performance of mobile devices.” 
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Oxide-to-nitride selectivity, 39, 44 
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in poly Si CMP, 159 
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NAND flash memory images, 155 
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43 
Via bottom, poor adhesion after heat 
treatment, 109 
Viscosity 
effects of electrolyte concentration 
on, 12 
PAA concentration and, 75, 76 
of silica microspheres, 11 
and zeta potential in ILD CMP, 12 
Volatile memory, 149 
Voltage bias conditions, in NAND- 
flash memory cells, 132 


W 


Wafer carrier motion, by CMP tool 
type, 4 

Wafer-level non-uniformity, 22, 23 

Wafer level variation, 23, 24 

Wafer nanotopography, 112 

Wafer surface irregularities, 112 

Wafer-to-wafer non-uniformity 
(WTWNU), 6 


191 


Wafering method, and impact of 
nanotopography on oxide 
CMP, 120-125 
Wax mounted polishing, 121, 123 
WIDNU. See also Within-die non- 
uniformity (WIDNU) 
and removal rate, 31 
Within-die non-uniformity (WIDNU), 
26, 30 
Within-wafer non-uniformity 
(WIWNU), 6, 14, 26 
of ceria slurries, 39 
low in ceria slurries, 47 
in modified vs. unmodified slurry, 
16 
removal rate and, 16 
for SKW1 wafers, 32 
in STI CMP, 44 


X 


XPS analysis, 159 
NGST film, 169 
poli Si and oxide film at pH 10, 158 
XRD powder diffraction pattern, 64 
in ceria slurry, 65 


Z 


Zeta potential, 11 
Cu, TaN films, and colloidal silica 
slurry, 84 
and viscosity in ILD CMP, 12 


